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$1. InTRODUCTION 


THE purpose of this article is to study one of the ways in which electricity 
can flow in a semiconductor containing impurities. The current in an 
impure semiconductor is due to two competing conduction processes which 
act in parallel. The first process is responsible for the current usually 
observed which (in for example n-type material) is carried by electrons in 
the conduction band in thermal equilibrium with electrons on donor 
impurities. The second process arises as follows: An electron occupying an 
isolated donor has a wave function localized about the impurity and an 
energy slightly below the conduction band minimum. Because there is a 
small but finite overlap of the wave function of an electron on one donor 
with neighbouring donors, a conduction process is possible in certain cir- 
cumstances in which the electron moves between centres by tunnel effect 
without activation into the conduction band. This we call impurity 
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conduction. The electrons of high mobility in the conduction band com- 
pletely dominate the conductivity at higher temperatures. However, 
although the mobility of an electron moving in the impurity levels is very 
small since it depends on interaction between widely spaced impurities, at 
low temperatures impurity conduction will dominate due to the absence of 
electrons in the conduction band. 

The circumstance in which impurity conduction is possible is the presence 
of ‘compensation’, by which we mean the presence of some minority 
centres, acceptors in an n-type conductor. These accept electrons from a 
certain proportion of the donors, thus allowing the movement of electrons 
from an occupied donor into an occupied one (fig. 1). Without compensa- 
tion impurity conduction is not possible, unless the overlap between the 
centres is very large; when this is large enough, corresponding to a critical 
concentration N,, another form of conductivity sets in, in which the 
electrons behave like a degenerate electron gas. This we shall treat in part II 
of this report and shall describe as metallic impurity conduction. 


Fig. 1 
conduction 
band 
6-2 a aoa — ~- donors 
Xe, = acceptors 


CEE ELE SEES 


Energy diagram of an n-type semiconductor containing donors and acceptors. 
The horizontal lines represent centres, the circles electrons in them. 


Impurity conduction} was first observed by Busch and Labhart (1946) 
in silicon carbide, and has since been observed in a large number of both 
n- and p-type semiconductors ; references are givenin§4. The concept of 
an electron bound to a donor centre is complementary to that of a hole 
bound to an acceptor. Hence our discussion, which for convenience is 
based on n-type, can readily be carried over to p-type material. Most 
experimental work has been done on the valence semiconductors, german- 
ium and silicon, and this article is mainly about these. 

A feature of impurity conduction, which distinguishes it from the usual 
semiconduction, is its extreme sensitivity to impurity concentration. For 
example, a change by a factor of 30 in the density of impurities in germanium 
can alter the conductivity by the impurity process by a factor of 107 
(fig. 4), while the corresponding change in the conductivity in the conduction 
band in the exhaustion range of temperatures (i.e. at those when the 
electrons are nearly all in the conduction band) is only of order 28. Another 


+ The possibility of impurity conduction was suggested on theoretical q 
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feature is that, when the impurity concentration is small, the curves 
plotting In p against 1/7 exhibit a finite slope in the temperature range where 
impurity conduction predominates, suggesting that the charge transfer 
between impurity centres must itself be thermally activated. Above the 
critical concentration N, mentioned above, the resistivity becomes 
independent of temperature; the conductivity is then apparently metallic 
and carriers move freely without thermal activation. There is a small 
transition region (a factor of order 4 in the impurity concentration in 
germanium) just below V,, in which the conductivity is non-metallic but 
the slopes of the curves (Inp vs. 1/7’) decrease and finally vanish at the 
critical concentration. A complicated temperature dependence of the 
Hall effect is also observed in this region (fig. 5). 

The theoretical interest of these phenomena is two-fold. First they 
give the opportunity of studying the transition from metallic to non- 
metallic conduction which occurs as the concentration of carriers is 
decreased. It has been postulated by one of us in a number of papers (Mott 
1949, 1952, 1956, 1957, 1961) that as the lattice spacing of an ordered array 
of atoms is increased there should be a sharp transition from a metallic to a 
non-metallic state of the valence electrons. The theoretical treatment of 
this transition is a many-body problem, involving the interactions between 
the electrons. No way is known, except possibly the use of high pressures, 
of changing the interatomic distance of a crystalline array of atoms over a 
large enough range, so that to test this hypothesis we are driven back to a 
study of a disordered array of centres such as occurs in doped germanium 
or silicon. 

The second point of interest is the mobility of an electron in a disordered 
lattice, considered as a one-body problem. For high concentrations giving 
metallic conductivity, we have a problem like that of a liquid metal, but 
with a greater degree of disorder. For low concentrations, it appears that 
the electron moves by a hopping process from one centre to another, inter- 
action with phonons being essential and the concept of a mean free path not 
appropriate. / 

The article therefore divides naturally into two parts. In the first we 
shall be concerned with the experimental observation and calculation of 
impurity conductivity in the region of low concentration. We discuss in 
§ 5 the reasons for using localized states and a phonon-activated hopping 
process in this low concentration region. The effect of compensating 
impurities and disorder is considered, and the theory of the interaction of 
localized carriers with lattice vibrations is traced through from the limits 
of strong coupling (polar semiconductors) to weak coupling (valence semi- 
conductors). Although we describe this process as a one-body problem, we 
must in our applications of Fermi statistics introduce the interaction 
between electrons in the sense that an electron cannot move into an impurity 
centre that is already occupied. The second part of this article will have 
as its theme the interaction between carriers in the impurity centres when 
this becomes large enough to lead to a transition to a metallic form of 
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conductivity. This second section will start from an assumption of what is 
to be expected from a crystalline arrangement of centres and then discuss 
the effect of disorder and compensation. 


Part I 
§ 2. THe MopEL AND GENERAL Discussion or Impurtry ConDUCTION 


If we neglect the dependence on direction of the effective mass, an 
electron occupying an isolated donor can be taken to move in a hydrogen- 
like orbit in the Coulomb field of the donor ion, with a Bohr radius 


Ayg=K (m/m*) ay 


where m* is the effective mass and a,, is the radius of a hydrogen atom 
(0-544). Because of the large dielectric constant « and small effective 
mass ratio m*/m, the orbit may extend over several hundred of the host 
lattice sites. The energy of this state lies slightly below the lowest state 
of the conduction band. A similar description applies to a hole bound to a 
negative acceptor impurity; in this case the energy of the vacant state 
lies slightly above the top of the valence band. A more exact description 
of the impurity states is given in §3, taking into account the dependence of 
m* on direction. 

In an n-type semiconductor (one in which the donor concentration NV 
exceeds the accept or concentration NV ,), at the absolute zero of temperature 
all the acceptors will be occupied and consequently negatively charged. 
The number of donors occupied and therefore neutral is Np—NV, (fig. 1). 
Overlap between wave functions corresponding to neighbouring sites allows 
movement from an occupied to an empty donor without activation into the. 
conduction band. Our study of impurity conduction will therefore be a 
study of transport of electrons in a random lattice from one positively 
charged donor to another and in the field of fixed negatively charged 
acceptors. The host crystal is regarded as a dielectric medium in which 
this random impurity lattice is imbedded; thermal energy is supplied by 
vibrations of the host crystal. 

As already emphasized in the introduction, at high concentrations of 
impurity the resistivity and Hall coefficient become independent of 
temperature at low temperatures, the electrons behaving like a degenerate 
electron gas. This behaviour, illustrated in figs. 4 and 5, is discussed in part 
II. Inthis section we consider low concentrations. There are two particu- 
larly simple cases which we may dicuss. If Ny is the concentration of 
majority centres (say donors) and N, the concentration of minority 
centres (acceptors), these are: 


Case (a). Np>Npy—-N, 


There will then be a small number (Ny—N,) of electrons in the donor 
states, and, owing to the random arrangement of immobile positively 
and negatively charged centres, a random fluctuation in the potential 
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energy from one centre to another. The question then arises, as in all 
these considerations (§5), whether in the absence of lattice vibrations the 
characteristic wave functions (solutions of the Schrédinger equation for a 
single electron) are localized or whether they spread through the lattice. 
By a localized wave function we mean one that decays exponentially to 
zero at large enough distances from a given point in space. Considerations 
set out in § 7 show that in a one-dimensional lattice they are always localized. 
In a three-dimensional lattice they are localized if the degree of disorder, 
or the ratio of the energy in the random field to the band width, are great 
enough (§5). We believe this to be the case in the range of concentration for 
which experimental measurements are made; the electron can then jump 
from one centre to another only with the help of phonons. The process by 
which it does so is a main theme of this report. But itis possible that, as the 
concentration of donors increases, there may be formed unbound states so 
that the activation energy for motion would be zero. It is possible that a 
‘crystallization’ of electrons, as envisaged by Wigner (1938) may occur, and 
a transition to a metallic state (condensed electron gas) only for higher 
concentrations of electrons. This is discussed in part II and by Mott 
(1961). 


Case (6). Np>N, 

In this case most of the donors are occupied and a small number vacant. 
The donor states which are unoccupied are to be thought of as carriers; as 
electrons jump from occupied to unoccupied donors, the positive vacancy 
moves through the lattice. As in case_(a) when the centres are still 
localized, we think of this as a hopping process from centre to centre. 

~ In case (5) it is possible to discuss the activation energy for charge transfer 
in terms of a simple model first suggested by one of us (Mott 1956). The 
carrier (the positive charge vacancy) will in its state of lowest energy lie as 
closely as possible to a negatively charged acceptor. Before conduction 
can occur the carrier must be thermally activated from this bound state. 
An energy of order 


B= (Jc) (I rpp) = 1-46 (e2/e) pt ww. () 


will be required to remove the carrier from the neighbourhood of the 
acceptor. Here rap is the nearest neighbour separation between a donor 
and an acceptor, and (1/r,p) denotes an average, assuming a Poisson 
distribution of centres. Price (1957) suggested as a better approximation 
that # should be of order 


Hae N= CN ay ee eee ee) 


he supposed that one should not consider removing the carrier to infinity 
but to a distance at which it is effectively outside the field of the particular 
acceptor concerned. The further movement of the ‘free’ carriers through 
the lattice may well require further activation, but the energies will be 
considerably smaller. Price has examined the statistics of this model. 
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We give here a simple derivation of the number » of free carriers. Assum- 
ing there is only one trap site associated with each acceptor, and a constant 
trapping energy JZ, the free energy of the carriers is : 


nE—kT [In{Np!/n\(Np—n)!}—In (Ng! /n'(N 4 — 2) HI. 
Minimizing this with respect to n, we obtain 
n®|(Np—n)(N'4—n) = exp (— B/kT) 
nz (Ny N,)* exp (—£/2k7) 
at low temperatures. Thus we expect the slope ¢, of the curve plotting 


Inp against 1/kT' to be approximately $# at low temperatures. Saturation 
in the carrier concentration should set in at temperatures given by 


exp(—H/2kT)~K'2, (K=N,/Np). . . . . (3) 


In, for example, p-type germanium containing 6-5 x10 majority 
impurities per cm? and a compensation ratio K = 0-05, the observed activa- 
tion energy at low temperatures is 1:6 x 10-3ev. Therefore the saturation 
temperature 7’; should be approximately 14°. A slight flattening of the 
resistivity curve is observed at around 5°, the high temperature end of the 
impurity conduction range (fig. 9, specimen (a)). However, this saturation 
effect is prominent only in specimens in which both the compensation and 
impurity concentration are small. Also, 7’; does not follow in detail the 
prediction of (3). For example, in specimen (0) of fig. 9 (for which K ~ 0-4) 
7's should be 11°, lower than in specimen (a), whereas the observed slope is 
practically constant. This is one example of the limitations of the trapping 
model. We emphasize again that the model is valid only for very small K, 
when the acceptors are separated in general by a large number of donors. 

The theory of Miller and Abrahams (1960), outlined in § 8, is not limited 
to small K and thus cannot use the assumption of trap sites. Moreover it is 
essential in their work to remember that some sites are occupied and some 
not, so that the electron or ‘hole’ cannot move except where there is a 
site ready for it. These authors find that the resistivity p is given by 


p(Z') cexp (¢,/kT’) 


(2) 


or 


where 
€g3=C—1-35 €, 
and 
€4 = (6/1) (40.N'/3)¥8 
and ¢ is the Fermi energy. This is defined, if 
f= 1/[1 + exp {(ei — $)/kT}] 


and ¢; is the energy, due to the random field, of the donor site i, by 


Lh =Np-,, 
The summation is approximated by an integration using a density of 
states function p(e)de. This is obtained by assuming the energy spread 
to arise from nearest neighbour negatively charged acceptors. The 


Theory of Impurity Conduction Ths 


probability that an acceptor is at a distance r from a donor, and is the 


nearest one, is 
Bp2 3 
p(r)dr= =" exp \- (=) \ dr, 
TA TA 


where the mean acceptor separation is given by 


ry = (3/47N 4) 


Then «= e?/xr, so we have 


ple) de = (3e43/e4) exp {—(e,/e)* de, 
where 
eg=elery. 
On performing the integration of ‘ over ¢, we find 
1— K=exp{—(¢4/¢)9} {1+ exp (—¢/kT)} 1 


which determines ¢. Unless K is extremely small or very close to unity, 
(= —«,(n(l—K)}48 


As we shall see in § 4, the magnitude of the resulting activation energy ¢, 
agrees well with experimental values (cf. fig. 6). For KS 0-2, 


€3 = €p—1-35 yes 1:61 (e?/K)(N pt — 1-35N 44), . ° . (4) 


which is similar in form to that predicted by Price (1957), formula (1.1). 

Finally we may ask whether, in case (b), we may expect for holes a 
phenomenon similar to that suggested in case (a) where the overlap becomes 
large, namely a state of affairs when the states for the hole are not localized. 
It is clear that, if K <1, the lowest state must be localized with an energy 
given by (1), because a Coulomb field always leads to bound states. How- 
ever, when the carrier has escaped from the field of the nearest charged 
acceptor, then for a high concentration of centres a hopping process may 
no longer be an appropriate description of the motion. 

In the remainder of this section we shall suppose that K is i than 4 
The temperatures at which impurity conduction can be observed is ee 
determined by the following factors. The mobility of a carrier moving in 
the impurity levels is much smaller than in the conduction band, since the 
former is determined by interactions between widely spaced impurities. 
On the other hand, the number of carriers in the conduction band is 
determined by an activation energy «,(~10~-*ev for germanium), while 
the energy «, regulating impurity conduction is at least an order of magni- 
tude smaller, varying from zero to 10-%ev, depending on the concentration 
of centres. Thus at higher temperatures the conductivity is determined by 
carriers in the conduction band, at low temperatures by those in the 
impurity levels, and the transition between the two regions is quite sharp 
(fig. 4). In the whole region we may write formally 


o=N Cu, + new 
where 1,, 4, are the number of carriers and the drift mobility in the con- 
duction band and n, » the same quantities for the impurity carriers. 
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The Hall coefficient R, in the same notation, should depend on these 
quantities according to the expression 


R= (epeoftape + MH) [CC(Mefte + MH)”, 

where jy.) yy is the Hall mobility of the conduction band and impurity 
electrons respectively. Neglecting any temperature variation in the 
mobilities, this expression has a maximum when 

N Cf, = Nep. oho) lk oct pee aes ear 
The Hall curves for high concentration (samples 14, 15, fig. 5) can be 
qualitatively explained by assuming that a Hall effect exists for carriers in 
impurity levels, with Hall mobility comparable to the drift mobility. At 
high temperatures, for which n,,> np, so that the normal conduction band 
Hall effect is observed, 


1 I 
Rimp= = (48) =. oc oy Cas ee ee ee 


The number of carriers is practically independent of temperature in this 
range (n> Np—WN,), giving a temperature-independent Hall curve, as 
observed (fig. 5). The Hall coefficient Rex» in the exhaustion range 
(temperatures between 77° to 300°K) is due to the same number (Vp—NV,) 
of carriers, in the conduction band, but measured values of Rexn are about 
eight times larger than Rimp. This suggests that .,/ is anomalously low. 
This is at present unexplained. 

For the samples of low concentration it is not known whether a Hall 
effect exists at the impurity conduction temperatures. The drop in the 
Hall curves beyond the maximum can be attributed quantitatively to the 
rapidly decreasing contribution of the conduction band current to the 
total current. A flattening of the curves beyond the maximum (which 
would indicate a finite Hall mobility) is not observed down to the lowest 
temperatures at which it has been possible to make measurements (fig. 5). 
It is not clear on theoretical grounds whether a Hall effect is to be expected 
when the conduction process involves jumps of bound carriers to neighbour- 
ing sites. ‘The experimental evidence suggests that R may exist, but be too 
small to be measured in the low concentration range. This is shown by the 
concentration dependence of R/p (fig. 2), measured at 2°K for the p-type 
samples of table 1. We see that R/p (which from (6) is proportional to jg) 
becomes small as the concentration falls through the transition region. 
This is discussed further in part IT. 

When charge carriers are positive vacancies (in n-type material with 
both impurity concentration and degree of compensation small), it might 
be thought that ifa Hall effect exists it should be of opposite sign to that due 
to electrons in the conduction band. However, in a magnetic field H 
the vacancies do not behave as true particles with positive effective mass 
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and charge. For example, if a positive and a negative particle are initially 
drifting in opposite directions, a magnetic field will deflect both particles in 
the same direction. However, a charge vacancy always moves in the 
opposite direction to an electron, even in a magnetic field (fig. 3). Hence 
we expect no change in sign in the Hall effect when the conduction is by 
charge transfer}. . 


Fig. 2 
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The Hall mobility R/p for p-type germanium with 40° compensation plotted 
against concentration of majority carriers (sample particulars in table 1). 


Fig. 3 
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The motion in a crossed electric field H and magnetic field H of (a) an electron 
and a hole, (6) an electron and a charge vacancy (in a weakly compen- 
sated semiconductor). 
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+ Yonemitsu e¢ al. (1960) recently reported observing a change in sign of 
the Hall effect in a p-type germanium specimen containing 2x 101® gallium 
impurities per cm® and 40%, compensation. However, although the magnitudes 
of their resistivity and Hall measurements agree well with those of Fritzsche 
(figs. 4, 5), the latter author observed no sign change. 
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In the band theory of periodic lattices the concept of a ‘hole’ which 
behaves as a positive charge with positive effective mass depends in part on 
the negative effective mass of electrons occupying states near the band 
maximum. This latter is a consequence of the increasing importance of 
Bragg reflection of electrons as their energies approach the band maximum. 
It is unlikely that the phases of electrons scattered by a completely dis- 
ordered lattice will match sufficiently well for electrons of any energy to show 
a negative effective mass. Hence, in the metallic region of impurity 
conduction in n-type material, we would not expect to observe hole 
conduction or a positive Hall effect, for any concentration on impurity 
electrons. It is for this reason among others that we have avoided use of 
the misleading term ‘impurity-band’ when speaking of the metallic 
region. 


§ 3. Toe Impurtry WAvE FUNCTIONS 
We have assumed in the last section a hydrogen atom model for an 
isolated impurity centre. The departures from this model are briefly 
outlined here. It has been shown by Kohn and Luttinger (see Kohn 1957) 
that the donor electron wave functions in germanium and silicon have the 
form 


[=a Fir) 4,(0), os gee eee 


where ¢; is the Bloch wave function at one of the conduction band minimum 
denoted by j, and the sum is over the V equivalent minima (N = 4 for ger- 
manium, NV =6 for silicon). The «; are coefficients which are determined 
by the symmetry of the state. ;(r) is an envelope function satisfying a 
Schrédinger equation for the potential due to the impurity, but with the 
free electron mass replaced by the effective mass appropriate to the jth 
minimum : 
Te Oe i 0s ene e 
E Sin Oo ale = 53) = Sh Pile) = af). 


The z-axis lies along the direction of the k-vector of the jth minimum, and 
m, mt are the longitudinal and transverse masses respectively. For the 
ground state, /’; has the hydrogen-like form 


Sig 28\1]8 

P(r) = (7a?b)—? exp \- (“25 + iz) \ é wee fe 8) 
For germanium, a= 64:54, b=22-74. Corrections to this effective-mass 
approximation partially lift the N-fold degeneracy of the ground state. 
These corrections arise chiefly from departures from a simple Coulomb 
potential in the immediate neighbourhood of the impurity ion, and because 
the concept of a uniform dielectric constant breaks down in that region. 
In germanium, the ground state splits into a non-degenerate lower level and 
a 3-fold degenerate upper level (Price 1956); the coefficients of the wave 
functions (7) corresponding to these levels are: 
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Vea (L Ak, fl, <. 1) lower level ; 
o®—=4(1,-1, 1,-1) 
o)—=4(1, 1, -1, —1) upper level. 


o=4(1,-1,-1, 1) 


The splitting is 0-57 x 10-%ev for antimony impurities, but is of order ten 
times larger for arsenic and phosphorus donors (Fritzsche 1960 a); hence 
the-upper states can be neglected except in antimony. 

The acceptor states can similarly be constructed from Bloch orbitals 
at the valence band maximum. In this case the envelope functions F 
satisfy a set of six coupled effective-mass equations (Kohn and Luttinger 
1955, Kohn and Schechter 1959) ; the solution of these is difficult and hence 
only approximate solutions of the acceptor wave functions are known. 


§ 4. OBSERVATIONS OF IMPURITY CONDUCTION 


Impurity conduction has been observed in many semiconductors at low 
temperaturest. We do not attempt to give here an exhaustive review of 
experimental results but confine ourselves to the case of germanium, which 
has been studied most extensively. Results for other materials are 
qualitatively similar. 

Prior to 1960, measurements in germanium and silicon were made in 
samples grown from melts doped with suitable impurities. For a quantita- 
tive comparison of theory and experiment it is desirable to have a range of 
measurements on samples where either (a) the majority impurity concentra- 
tion is varied, but the degree of compensation K is kept constant, or (b) K is 
varied but the majority concentration is kept constant. It was practically 
impossible to achieve these conditions in the early measurements. Conse- 
quently, since the resistivity and Hall coefficient vary extremely rapidly 
with impurity concentration, comparison of theory and experiment was 
uncertain. Recently, however, Fritzsche and Cuevas (1960a) has pub- 
lished measurements on p-type germanium samples in which the acceptor 
concentration (gallium) ranges between 8x 10'4cem~? to 1:3 x 1048 cm? 
and the compensation ratio (arsenic and selenium) is kept fixed at K = 0-4. 
The impurities were introduced into pure germanium by slow neutron 
bombardment (Cleland et al. 1950), causing transmutation of germanium , 
atoms. The proportion of different impurities produced is determined 
by the cross sections for neutron capture and the decay schemes of the 
various germanium isotopes ; therefore the compensation ratio is constant. 
The magnitude of the impurity concentration can be varied by the neutron 
flux and exposure times of different samples. Figures 4 and 5 show results 
of measurements of the resistivity p and Hall coefficient R; table 1 gives 


information about the specimens. 


Ce a Steet A eee CE a ae ees 

+ SiC: Busch and Labhart (1946); Ge: Hung and Gliessman (1950, 1954); 
CdS: Kroger et al. (1954); Si: Morin and Maita (1954), Carlson (1955); p-InSb: 
Fritzsche and Lark-Horovitz (1955); n-InSb: Sladek (1958); Te: Fukuroi 


et al. (1954). 
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Fig. 4 
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Resistivity as a function of temperature of p-type germanium with compen- 
sation K=0-4; particulars in table 1 (Fritzsche and Cuevas 1960 a). 
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Fig. 5 
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Hall constants of p-type germanium as functions of temperature (particulars in 
table 1). 
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Table 1 
Sample Na (cm) p(T =2-5°K) Q em p(em?/volt sec) © 

1 7-5 x 1014 8-9 x 10° 4-3 x 10-5 
2 1-4 x 1045 1-8 x 107 12x10 
3 1-5 x 1045 1:0 x 10’ 2:3x 10-3 
4 2-66 x 1015 5-6 x 105 3-0 x 10-2 
5 3:6 x 1015 1:0 x 10° 0-11 
6 4-9 x 1015 2-5 x 104 0-53 
7 7-2 x 1045 6-3 x 108 2-4 
8 9-0 x 1015 2-0 x 108 
9 1-4 x 1016 4-5 x 10? 

10 2-4 x 1016 1-0 x 10? 16 

ll 3°5 x 1016 28 4:0 

12 “Seto OLs 20 

13 1-0 x 101? 0-50 

14 1-5 x 101? 0-18 180 

15 5-0 x 101? 3:2 x 10-? 250 

16 1-35 x 1018 1:8 x 10-2 


Estimated mobilities 1 are shown in column 4 of table 1. 
For specimens (1—7) with low impurity content, j. is obtained from values 
of the resistivity at the temperature of the Hall maximum by making use of 


eqn. (5): =p,n,[n=(N,Kep,)—. 


Fig. 6 
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The activation energy ¢; of impurity conduction for the samples in table 1. 
Anon curve represents the calculation of Miller and Abrahams 
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We have assumed the number of impurity carriers n to be NV p(=KN,), 
since K islessthan4. At higher concentrations, we use simply 


u=E/p. 
This is really the Hall mobility, which as we have seen in § 2 may be appreci- 
ably smaller than the drift mobility. 

Values of the activation energy «, in the low temperature region are 
plotted in fig. 6, against the average acceptor separation. The magnitude 
of «, agrees well with the values calculated from the theory of Miller and 
Abrahams (1960), shown by the dotted line, in the region of low con- 
centration. 

Measurements of the conductivities 1/p of n-type samples can be fitted 
by a sum of three exponentials (see, for example, Fritzsche 1958), 


1/p=c, exp (— e,/kT’) +c, exp (— «,/kT') +c, exp (—,/kT). 


Here ¢, is the activation energy for exciting an electron into the conduction 
band, and e, that for impurity conduction. The role of ¢,, which occurs 
only for samples in the transition region (2 x 10!6<N <8 x 10!%), is not 
clear. ¢, is observed in weakly compensated n-type and p-type samples, 
not however in p-type samples having K=0-4. A fourth activation 
energy was observed at temperatures below 1°K (Zaravstikaya 1956), in 
low concentration specimens. This however has since been shown to be 
due to stray light quanta exciting electrons into the conduction band. The 
resistivity is so large at these temperatures (of order 101°Q.cm) that a 
very small fraction of excited electrons can lower the resistance appreciably. 

Some measurements of the resistivity have recently been reported on 
germanium samples of constant acceptor concentration (2-66 x 10 cm-*) in 
which the degree of compensation was varied from 0-4 to 0-9 (Fritzsche 
and Cuevas 1960b). These are shown in fig. 7. 

The samples were produced by bombarding specimens of n-type ger- 
manium with slow neutrons. The same neutron flux and exposure time 
was used, so that the same acceptor (gallium) concentration was produced 
in each specimen. The degree of compensation however depends on the 
initial donor concentration. Figure 8 shows the observed activation 
energy as a function of K. The solid line is that predicted by Miller and 
Abrahams (1960) and is in good agreement, except at very high values of K. 

The variation of the resistivity with the degree of compensaton should 
depend on whether the specimen shows metallic or non-metallic conduction. 
Measurements (in addition to those above) in which K is varied but the 
majority concentration is kept constant, have been made by Fritzsche 
(1955), Fritzsche and Lark-Horovitz (1959), in n- and p-type germanium, 
and by Ray and Longo (1959) in n- and p-type silicon. The results can be 
summarized as follows. . 

(i) In specimens showing metallic conductivity when K ~ 0, both the 
resistivity and Hall coefficient increase with K, the activation energy ¢€3 
remaining zero provided K is not too large. In p-type germanium samples 
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Fig. 7 
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The resistivity of p-type germanium with constant acceptor concentration Na 
and compensation K varying from 0-4 to 0-9. 
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with N,=2-5x10!cm-, Fritzsche and Lark-Horovitz (1959) observed 
that the conductivity becomes non-metallic as K increased between 0:4 
and 0-7, the Hall and resistivity curves showing behaviour types of the 
transition region (figs. 9, 10). 

(ii) In specimens showing non-metallic conductivity when weakly 
compensated, the resistivity at a constant temperature decreases to a 
minimum as K is increased to about 0:4, and thereafter increases. 

Writing 

p=poexp (—«,/kT), 
po is found to increase with K (fig. 11). However, in samples with 
Np=10'%cm-3, py) remains approximately constant in the range 
103 < K <10~ (Fritzsche 1960, private communication); these samples 
have anon-vanishing Hall coefficient and are in the transition range. 

These results can be understood in part by considering the effect of 
compensation on the carrier concentration. In the metallic range the 
number n of carriersis Vmaj(1—K). Inthenon-metallicrange n=N maj K 
when K <4 (the carriers are vacancies on impurity sites) and n= N maj; 
(1—K) when K>4. Hence, since 


p= 1/neu, 


P.M.S. 
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Resistivity of p-type germanium with variable degree of compensation 
(Fritzsche and Lark-Horovitz 1959). 
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Fig. 10 
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The Hall constant of the samples of fig. 9. 


if the conduction is metallic p should always increase with K, assuming 
the carrier mobility , to depend only weakly on K. If the conduction is 
non-metallic, p should decrease as K increases to $, and increase as K goes 
from 4 to 1. 

The dependence of p on K in the non-metallic range has not however 
been adequately explained. Since the term exp (¢,/k7') dominates p at low 
temperatures, and ¢, is observed to have a minimum value for K ~0-4 


LZ 
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(rather than K =0-5), a minimum is also observed in p at K=0-4. Al- 
though Miller and Abrahams (1960) have been able to calculate the observed 
dependence of e, on K (fig. 8), in their theory the pre-exponential factor po 
is independent of K. This independence is observed only for K < 10~, and 
then for specimens in the transition range, to which their theory does not 
apply. On the basis of the trapping model (§ 2), po(K) is again predicted 
incorrectly. For in this model the number of free carriers is proportional to 
K*2(eqn.(2)). When K is very small we can assume that the mobility 
of {free carriers between sites at large distances from compensating 
centres is independent of K. Hence py should be proportional to K—?, 
whereas py, is observed to increase with K (fig. 11). 


Fig. 11 


O K 1 


Variation of the pre-exponential factor po (in the equation p= p, exp (— ¢;/k7’)) 
with compensation K; (a) experimental variation, (6) calculated by 
Miller and Abrahams (1960) and (c) calculated on the trap model. 


We have seen that the magnitude of the impurity conduction depends 
sensitively on the impurity concentration and degree of compensation. <A 
further variable is the type of impurity. For example, when corrections 
to the effective-mass formalism (§ 3) are taken into account, it is found that a 
donor electron is localized in a smaller volume around an arsenic impurity 
atom than around antimony. Hence the resistivity is larger for arsenic 
than for antimony impurities, if the same number of both are present, due 
to the smaller overlap of the arsenic wave functions with neighbouring 
states. 

The Bohr radius of the impurity wave functions in silicon is of order } 
that in germanium. Hence the impurity conductivity in silicon is always 
very much smaller than in germanium, for comparable impurity concentra- 
tions. Also, the energy e, required to activate electrons into the conduction 
band is of order 5 x 10-* ev in silicon and 1 x 10-evin germanium. There- 
fore the onset of impurity conduction is observed at higher temperatures 
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in silicon than in germanium (again for comparable impurity concentra- 
tions) due the faster freeze-out of electrons from the conduction band of 
silicon. 

The impurity concentration at which the transition from non-metallic 
to metallic conduction is observed depends on the overlap of neighbouring 
donor states, and on the degree of compensation. Because the impurity 
states are more localized in silicon than in germanium, the transition is 
observed at higher impurity concentrations in silicon. Similarly, because 
the impurity Bohr radius is different for different impurities, the transition 
concentration is also a function of the type of impurity. Fritzsche 
and Cuevas (1960 b) finds that the activation energy ¢, in p-type (gallium 
doped) germanium disappears at N, = 1-09 x 10!7cm-3 when K ~ 0-04, and 
at N,=1-80x10!7em-3 when K=0:4. When N,=2-5~x10!?cm-3, the 
activation energy disappears at a value of K in the range 0-4 to 0-7 
(Fritzsche and Lark-Horovitz 1959). These results are discussed in more 
detail in § 11. 

It is found that there is a large change in impurity resistivity when the 
overlap of neighbouring impurity states is altered by straining the crystal. 
Wesaw in§ 3 that the wave function ¢% of an isolated donor can be constructed 
from Bloch orbitals taken from the degenerate conduction band minima. 
Although the envelope functions F (eqn. (8)) are anisotropic, in an un- 
strained crystal % has the tetragonal symmetry of the lattice, and the 
overlap of neighbouring donor states leads to an isotopic resistivity. Ina 
germanium crystal strained along, for example, the [110] direction, two 
conduction band minima are depressed in energy relative to the other two. 
If the strain is large enough, the donor electron ground state will be a sum 
of the F’; ¢; corresponding to the depressed minima only, 


by = 2-1? (FO, + F; ;). 


The overlap of such functions will vary strongly with the direction to the 
neighbouring donor relative to the strain direction. Hence we expect that 
-in a strained single crystal the impurity resistivity will be anisotropic, 
with a maximum value in the direction of least overlap. Fritzsche (1960 b) 
has measured the change in impurity conduction in germanium containing 
5-2 x 10 antimony atoms per unit volume at 1-9°K as a function of uniaxial 
tension and compression along the [110] direction. Figure 12 shows the 
resistivity ratios of the strained to the unstrained sample for the three 
principle directions of the resistivity tensor (labelled C, D and E). A 
‘saturation’ is observed when strains are large enough for only the depressed 
minima to contribute to the donor wavefunction. The relative magnitudes 
of po, pp and px are in agreement with predictions on the basis of the overlap 
of these functions. 

Additional evidence that there is a conduction process in the impurity 
energy levels at low temperatures is provided by the work of Sladek (1956, 
1958, 1959) on magnetically induced impurity banding in n-type indium 
antimonide. Because of the small effective mass ratio (m*/m=0-013) and 
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large dielectric constant («=16), the Bohr radius of a donor electron in 
indium antimonide is large, about 1444, and the ionisation energy of an 
isolated donor is small (0-:0067ev). For all obtainable purities the overlap 
of donor wave funtions is large and the levels are broadened and merge'with 
the bottom of the conduction band. A strong magnetic field will shrink 
the donor wave funtions (Yafet et al. 1956), producing two effects. 


Fig. 12 
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Ratio of the resistivity of stressed and unstressed n-type germanium, for the 
principle directions, as a function of stress (Fritzsche 1960 b). 


(i) The donor ionization energy is increased due to the decrease in the 
Coulomb energy of the electron. Thus for a large enough magnetic field 
donor levels will be split off from the bottom of the conduction band. 

(ii) The overlap of the donor wave functions is decreased. This effect 
is large because of the small effective mass. Figure 13 shows measurements 
of the Hall coefficient R as a function of temperature and magnetic field 
strength, in a specimen containing 5-3 x10! donors per unit volume 
(Sladek 1958). For asmall magnetic field R is approximately temperature- 
independent since the specimen is degenerate. At fields greater than about 
10k-gauss we see the onset of typical impurity conduction. R reaches a 
maximum value at around 20°K due to a competing conduction process in 
the split-off donor levels. The depression of the donor levels below the 
conduction band increases with the magnetic field B, as shown by the 
increasing slope of the Hall curve with B at temperatures above the Hall 
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Fig. 13 
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The Hall constant as a function of magnetic field in InSb containing 5-3 x 10™ 
donors/em? (Sladek 1958). 
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The effect of a magnetic field B transverse to the current direction on the 
mobility of impurity electrons in n-type InSb; a, is the Bohr radius of a 
donor wave function in a direction perpendicular to B (Sladek 1958). 
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maximum. The Hall mobility 1, for the same specimen measured at a 
temperature below the Hall peak, is plotted in fig. 14 against (a9/a,)?. The 
Bohr radius ay is the zero field value, while a, is the radius normal to the 
field H, calculated from the theory of Yafet et al. (1956). Thus as the 
magnetic field shrinks the donor electron’s orbit and hence lowers the 
overlap of neighbouring donor states, the electron’s mobility decreases, as 
is to be expected on the impurity conduction model. 


§ 5. Mernops or CaLcuLatING THE ELEcTRicaL Conpuctivity aT Low 
CONCENTRATIONS 


By low concentrations we mean here concentrations such that the electron 
gas is not ‘metallic’, so that the conductivity tends to zero with the 
temperature. In this range of concentration we have to do essentially with 
a ‘one-body’ problem, the movement of a single electron under cireum- 
stances in which the interaction between electrons is not important (except 
in so far as an occupied centre blocks the passage of an electron from another 
centre). However, even so, the problem is complicated, and far from a 
complete solution. 

If we take as our first problem the movement of an electron in a disordered 
lattice in which the ions are assumed to be at rest, we have to ask whether 
the states are localized or unbounded in space. By disordered, we mean 
either in random positions or acted on by a random field (such as that from 
the charged minority centres) or both. By a ‘localized’ state we mean 
that each characteristic solution of the Schrodinger equation for an electron 
in this field decays exponentially to zero at sufficiently large distances from 
some point in space. The problem of the conditions under which states are 
localized or not is by no means solved. In §7 we reach the conclusion that 
all states in a one-dimensional lattice may be localized. This may corres- 
pond to the theorem that in one dimension any potential hole, however 
small, leads to a bound state. In three dimensions the work of Anderson 
(1958) was the first to show that, if the separation between impurity centres 
is large enough, all states would be localized, but at some higher concentra- 
tion one would go over to unbound states}. Anderson was concerned with 
spin diffusion; Twose (1959) extended his work to impurity conduction, 
finding as the criterion for bound states 


Ny< 10-4/a,2 ode eed tes SEB felt cre Maes (9) 


where a, is the hydrogen radius of the centres (h?«/m*e2). Due to the 
approximations that it was necessary to make in this work, the result 
may be too smallf. 

Suppose now that (9) is not satisfied, so that an electron has a wave 
function extending through the lattice. We shall now consider the 
SS ee nan en ees oe Dee ie EE 


{ This concentration has nothing to do with the concentration for metallic 
conduction discussed in part IT. 


{ See Appendix for detailed discussion. 
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problem of its mobility, under conditions when the interaction with other 
electrons is not important (i.e. 1—_K <1, or for a ‘metallic’ electron gas). 

We shall have to calculate the time of relaxation 7 and mean free path l. 
The question arises as to whether the usual transport theory can be used, 
developed as it is for the case where the scattering processes are weak 
compared with those interactions between lattice atoms which broaden 
the sharp electronic levels of the isolated atom into a band in the solid. 
If vs is an average relaxation time for a carrier scattered between states 
of different energy and momentum in the band, a suitable criterion that 
the band approach be a good one is (Joffe 1956, Herring 1959, Yamashita. 
and Kurosawa 1960) 

t3xh|W, 
otherwise the uncertainty in energy of the scattered carrier would exceed 
the bandwidth. This leads to a lower limit on the carrier mobility p below 
which the band model will not be an adequate starting point for a con- 
ductivity calculation. For 
nets. en ea? (10) 

where m* is the effective mass of the carrier and a the lattice spacing. 
Here we have used the Bloch tight-binding model to estimate m*W ; this 
should be a valid approximation in those substances for which the band 
approach is questionable, for example, the narrow 3d ‘band’ in transition 
metal oxides. With a~4A, the mobility must be larger than of order 


[y= 0-5 em?/volt sec. Ree pee ee ie, 


Similarly, the carrier mean free path 1~7s6 (where @ is the mean velocity) 
must be larger than the order of the lattice spacing : 


o~ah|m*a, 
(12) 


Tht = mh? a TO 

m*a~ am*W 6” 
The numerical factors are of course approximate. It is possible to define 
the wave vector k of a Bloch electron only to within 1//, on a similar argu- 
ment. 

In metals, these conditions are usually easily satisfied. Electrons in 
copper, for example, have a mean free path of order 140 lattice spacings 
and mobilities 30 cm2/volt sec at room temperature. In a large number of 
semiconductors however, the carrier mobility may be of order jz; or lower. 
Taking the transition metal oxides as examples, in Lyi Nip, O carriers in 
the 3d (Ni) levels have mobilities ranging from 10-% to 7 x 10- cm?/volt sec 
from room temperature to 1000°K (Morin 1958). In titanium oxide 
(TiO) Morin estimates the ‘d band’ mobility as 0-4cm?/volt sec at room 
temperature. In the case of impurity conduction in germanium or silicon, 
the condition (10) for the mobility becomes 


e 3 \ 28 
ep jalan [dee tas =1014N-22 em2/voltem. . . . (13) 
PS Gh a (18) 
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Here we have assumed that the majority impurities, of concentration N, 
lie on a lattice of average spacing (3/47N)!3. Table 2 compares experi- 
mental estimates of the mobility, exp, with p, for n-type germanium. 
Thus a treatment of impurity conduction which assumes that the impurity 
electrons can be described by modified Bloch-type functions would appear 
to break down as NV decreases into the transition region. 


Table 2 


N (cm-*) 1017 


fz, (em?/volt sec) 5x10? 
Hexp (em?/volt sec) 4:10? 


Methods based on the density matrix have been developed to handle the 
conductivity of solids under these conditions; we have thought it worth 
while to include in the next section a simplified treatment, limited to one 
dimension, to show what these methods mean. 

We turn now to the case when the states are localized. Then in general a 
finite amount of energy is necessary to move an electron from one localized 
state to the next. This can only come from phonons. We have thus to 
consider the interaction with phonons. 

Phonons can act in two ways, which can be distinguished in the language 
of field theory as strong and weak interactions. In polar lattices it has 
been known for a long time that ‘self-trapping’ is a possibility (Landau 
1933, Mott and Gurney 1940, p. 86). The electron or positive carrier 
polarises the lattice round it and can only move by carrying this polarization 
with it. A jump from one site to another is thus a multiphonon transition. 
This is believed to occur in nickel oxide, the carrier being a positive vacancy 
on a Ni*+ ion. 

Single phonon transitions are probably of predominating importance in 
valence semiconductors, and these are discussed in§7. We start here with 
the concept of a localized state; an electron is localized in a given centre 
and, receiving energy from a phonon, it makes a transition by tunnel effect 
to another centre where the energy is different. This is common to the 
treatment of Twose and of Miller and Abrahams: the latter are interested 
in the case where K is not small and so all centres are not available for any 
one moving electron or hole. 


§ 6. THE PROPERTIES OF A ONE-DIMENSIONAL DISORDERED 
LATTICE 
In this section we investigate some of the properties of a one-dimensional 
lattice, not with a view to applying them to the actual problem, but as an 
illustration of some of the principles involved. 
We shall first (in § 6.1) investigate the mobility of an electron in a one- 
dimensional lattice, assuming that the wave functions are extended 
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throughout the lattice. The purpose of the investigation is to give a 
method appropriate to the case when the mean free path is comparable with 
the electron’s wavelength, so that the usual transport theory based on 
Boltzmann’s equation is not applicable. The calculation for three dimen- 
sions has been given by Edwards (1958), but that in one dimension is so 
much simpler that it is worth reproducing. 

In § 6.2 we shall examine the nature of the wave functions in a one-dimen- 
sional disordered lattice. There have been many investigations of the. 
density of states in this case (for refs see Frisch and Lloyd 1960) but as far 
as we know none of the nature of the states. We shall show that in many 
cases (perhaps in all) the states are bound. 


6.1. The Conductivity of Electrons in a One-dimensional Disordered 
Lattice 

Thee nehont this ooo we discuss the conductivity of carriers on a 
one-dimensional ‘wire’ of length LZ, on which they have a non-periodic 
potential energy V(x). We have to develop a transport theory appropriate 
to this case. Before doing this we shall set down the usual transport 
theory based on Boltzmann’s equation in a form appropriate to a one- 
dimensional model. 

In this case, where the state of the carrier can be defined by a wave 
number & and a time of relaxation 7 can be defined, the current j is given by 


jae] "(bf (@yudk es aa TES 


where N(k)dk is the number of states in the range k to k+dk, f(k) the 
probability that a state k is occupied and wu the velocity (u=dH/hdk). Ifa 
field F is applied, then in a steady state 

f=for (Gfldk)ePr[h, 
where f, is the form of f in the absence of a field. Thus from (14) we obtain 


2 
j= = [wos Dur dk. ey nN iepleds (als) 
We may write wr=/, where / is the mean oo path, and 
N(k)=L, 
so that : : 
erL (,d 
p= — dH. tenes Eee aera CLO 
jas |"am4 (16) 


This formula gives the total current in the wire. The current C' at any 
point is obtained by dividing by L, so that 


om fia. Mirman ers cha 
If we wish to derive a mobility from these formulae, we SB suppose a 


Boltzmann distribution of electrons and set 
f=const. exp (—/kT), 
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so that, from (16) 


OF oan ee 
_2F (ny oF ape ut .dk, 
j=5 [x0 Fala eR [N(a) flul | 


from which we see that the mobility p is given by 
p=elul/kT ~ ellm*u~er/m*, 


where m* is the effective mass. 

We shall now derive by a simplified method a formula due to Greenwood 
(1958) for the current O, valid even when a mean free path cannot be defined. 
We assume as before that the electron moves on a circular wire of length L 
on which its potential energy V(x) is that of some random (non-periodic) 
field. Then the Schrédinger equation of such an electron is 

ds 2m 


The solutions are subject to the boundary conditions 


(2) ak f(a +L), f(x) ce ib (x +L), 
so that they are single-valued on the circular wire; they are real and in 
general non-degenerate. We denote the stationary wave functions and 
energy states by %,, #,. Asin Greenwood’s paper we imagine an electro- 
motive force induced in the circuit by a magnetic field, threading the circuit 
and increasing uniformly with the time. The perturbing term in a time- 


dependent Hamiltonian is 
(heF'/mi) t d/dx, 


where F is the induced electric field and ¢ the time. If then an electron is 
initially in the state n, at a time ¢t after the perturbation is switched on, 
the wave function ’ is given by 


Y= Ln exp ( a 1H, t/h) Ae > 4nn(t) P(X) exp ( =y iL, t/h), 


where 
t 
(h/t) Onn (t) = (heF /mi)D,» | texp (iw,,t) dt. 
Here ; 
L a) 
UB Pa, ee i: By Ox Yn dx, 
and 


nn (£,, eo E,,)[h : 
Integration by parts and neglect of oscillating terms as in Greenwood’s 
paper gives 
An = (€F/m) Dyn {1 — exp (iwynt)} orn’ 
The current due to an electron initially in state n is 


eh 0 ) : 
Imi fev Fed Ey Ox eT) du = (e°h.F'/m*) a3 | Del sin Wnn't/w* nn’. 
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If f,, is the occupation number of each state, the resultant current may be 
written 


aa dee nv sin{(H, —H,: )elRs 


If N(£) is the ete of states, ok. for ee mite of the time ¢t this 
becomes 


(18) 


me*h mr | Dw se |? —— oe ny} aLy, Gh teelpr ts (19) 


where the bar gives an average over is of n’ near ton. This formula 
is valid in general; it will be noted that m is the electronic mass, not the 
effective mass. Also to obtain the current C at any point we must divide 
by L. 

Formula (19) is the basic expression for conductivity in one dimension. 
With any such formula, our first task is to show that for small perturbing 
energy V it leads to the formula given by the Boltzmann equation. This 
has been done, in the general three-dimensional case, by Edwards (1958). 
His proof uses advanced methods, and we shall now give a simple discussion 
to show how the Boltzmann expression (17) arises. 

For any form of V(x) we can define a wave number k such that 1/k is the 
mean distance between zeros of %,. Then as before 


N(£)=L/(dE/dk). 
If the disordered field is small perturbation on a periodic field in which the 
electron has an effective mass m* 

N(E) =LIm* [hk CA ens oc eT LO) 

We shall now estimate |D,,,, |? for this case. First we note that, while 

the diagonal element D,,,, vanishes, the off-diagonal element D,,,, is not 
small even if n and n’ differ only by unity, since the phase of %,, may differ 
from that of ,,,by alargeamount. Thus the integral 


| thn (Oiby/Oxe) de, 


integrated over one mean free path J, a distance in which the wave functions 
are coherent, will be of order 

WAU OES ieee cl eer een CAD, 
the quantity L in the denominator comes from the normalizing factor of the 
functions %,,. There are L/J mean free paths in the length in the wire, 
and the signs of the contribution of each to D will be random; thus, to 
obtain the root mean square of D, (21) should be multiplied by (L/1)"#, and 


| Drm | ~ (1 L)¥2 mke|m*. PRA MOD) 
Substituting for | D,,,. |? and for {N(£)}? in eqn. (19) and dividing as before 
by L to get the curent at any point, we find 


c= [ate 


apart from numerical factors, which is identical with (17). 
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It is of interest to show directly that the mean free path as usually 
defined can be equated (apart from a numerical constant) with the distance 
in which any given phase relationship between two functions, $,, %,- of 
very nearly the same energy is lost. For one dimension we shall treat 
the problem by considering a row of scattering centres at points 
4, Ly, ... , each reflecting a small proportion of any incident wave. Itis 
convenient to use at each of these centres the delta-function used by Lax 
and Phillips (1958) so that at each centre ¢ is continuous and 7’ changes by 


Ais, where 
Af’ /p= 
and 7 is small. 

Suppose that such a centre is situated at x = 0 and we ask for the amplitude 
of the wave reflected from it. We set for an incident and reflected wave 
(x<0) 
= exp (ikx) + A exp (—tka) 
and for a transmitted wave (x > 0) 

= Bexp (ikz). 
The boundary conditions give 
1+A=B, 1-A=B+q 
where g=y/k. Hence 
A= 44. 
Thus the amplitude reflected by the first NV of such centres is 


N 
$¢q > exp (2ikay). 
1 


An estimate of the number NV of centres in one mean free path would be_ 
obtained by equating this to unity. 

Now consider the phase of a real function % and let us ask how much 
it changes at a scattering centre. Let us write for x<0 

=A, cos (kav + ¢,) 
and for 7 >0 

=A, cos (ka + €,). 
The boundary conditions give 

A,cos{,=A,cosé,, tanf,—tanl,=y/k=q, 
and hence, since q¢ is small 
AC=¢,—¢,=q cos ¢, cos ¢,. 

Since ¢, and ¢, are nearly equal, this may be written 

AC = 49 (cos 2¢—1). 

The second term (—1) in the bracket obviously makes no contribution 
to the rate at which two waves get out of phase with each other. The 
total shift in the phase of two waves initially 47 out of phase with each other 
will be the difference between }$q¢ cos 2kx,, and 4qsin 2kx,, which will be of 
order unity in one mean free path as defined above. Either method gives 
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for the mean free path a quantity of order 

I~ b? q?/(Ab)?, 
where 6 is the mean distance between centres, Ab the root mean square 
fluctuation and g=7/k as before. 

A particularly interesting application of these ideas is to the ‘tight 
binding’ case, which of course is applicable to impurity-band conduction. 
The overlap integral between two hydrogen-like centres, of Bohr radius a, 
distance 6 apart, is 

j=2W,(1+6/a) exp (—b/a). 
Elementary considerations based on a deformation potential suggest that, 
for a mean displacement Ad from positions in a periodic lattice, the mean 
free path will be of order 


Aj\2 .. 8j 
eh z= AD, 
Gay rok 


The mean free path / will be of order 6 when Aj ~j and if b > a this will occur 
when Ab~a, and thus for Ab/b<1. It is possible—and the experimental 
evidence reviewed in § 12 suggests that this is the case—that / may become 
less than b before bound states occur—that is to say, in the three-dimensional 
case. If so, we shall want to know what happens to formula (22) in this 
case. No detailed calculations have been given, but we should expect a 
random fluctuation of the amplitude of the wave function on each centre by 
exp(—Ab/a). Since the normalization of the wave function will be deter- 
mined by the larger amplitudes, typical terms in the integral for D,,,,, will 
be reduced by just this factor. We thus expect the mean free path / to be 
of order 1~bexp (—2Ab/a). 


6.2. Bound States in the One-dimensional Model 

The above analysis assumes that the wave functions extend through the 
lattice (the states are not bound). Actually in one dimension this may 
not be the case, but the analysis is of some interest on account of its 
possible extension to three dimensions. | 

The simplest model for which this may be seen is that in which we take 
for V a potential of Kronig—Penney type, as shown in fig. 15, namely a 
series of potential barriers, a mean distance 6 apart, but such that the root 
mean square of bis Ab. Wesupposed that Ad is greater than or comparable 
with the wave length of sin the range AB. We suppose that the two solu- 
tions, which increase or decrease exponentially in the range BC, increase 
or decrease by p or 1/p, and that p is large. 

Consider an oscillating solution in the range AB of form y= cos (kx + ¢). 
Then for all phases £(—7<¢<7), except those in a range of order 1/p, 
will increase in the range BC. If has increased in the range BC, then, 
whatever the phase in AB, % will in general increase again by the factor p 
in DE. Thus the solution with arbitrary phase in AB increases exponenti- 
ally as x increases, and by the same argument will increase exponentially as 
x diminishes. Such a solution is shown in fig. 15 and obviously has no 
physical significance. 
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As we have seen, for all phases ¢ in AB except those in a range AC ~ 1/p, 
zs increases in BC; and for these solutions ¢ varies only by 1/p in CD, by 
1/p2in EF andsoon. ‘Thus our typical solution, obtained by starting with 
arbitrary energy and phase in an interval such as AB, will increase ex- 
ponentially in both directions, the phase becoming more closely defined as 
the distance from AB increases. It is of course possible to choose the phase 
in AB so as to give an exponential solution which increases or decreases 
over any relevant range of x. 

The stationary solution may be found as follows. Take any gap in the 
potential energy, for instance CD. For any energy W, one can set up 
solutions from both sides which increase as one comes towards CD. In 
general these solutions will not fit in the middle (fig. 16), but one can 
choose values of the energy such that they do. These are the quantized 
energy values of the problem. 


Fig. 15 


Wave function in a one-dimensional disordered lattice. 
Fig. 16 


Wave functions for a forbidden value of the energy. 
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Our conclusion that the stationary solutions of the problem correspond 
to bound states thus holds unless there is an accidental degeneracy. 
Although for a one-dimensional space of length L the energy levels are 
spaced very closely, such that 

AW ~ WO/L, 
two localized wave functions distant in space na apart must have the 
same energy to any accuracy of order W (1/p)” to give a resonance, and this 
will be unlikely. 

We have assumed that kAD~1. If kAb is small and the energy is chosen 
in the allowed band-width, it is clear that the average increase of ~ in 
going through each barrier is not 


p=exp [b{2m(H — E)/h?}12], 
where H is the height of a barrier, but of order 
exp [Ab{2m(H — E)/h?}]. 


For a periodic field (Ab =0).the states are of course not localized. We have 
not been able to extend the proof to a more general case. 


§7. Toe INTERACTION oF LocaLIzED CARRIERS WITH LaTTICE 
VIBRATIONS 

In this section we shall be dealing throughout with a three-dimensional 
lattice. Our problem is to discuss the movement of an electron from centre 
to centre under conditions in which the states are localized, so that lattice 
vibrations are essential in making the transitions possible. We have to 
distinguish between strong coupling, appropriate to ionic lattices, and 
weak coupling, appropriate to silicon and germanium. In the former case, 
we have to show how the concept of a polaron arises. 

In the strong coupling case we may consider the compound Li, Ni,_,0O. 
Here a small fraction of Ni?* ions have been replaced by Li* ions, with the 
formation of Ni** ions to preserve electrical neutrality. Hach of these 
can be regarded as a positive hole bound to a Ni?* site. We discuss the 
- motion of this hole through the lattice of Ni?+ ions; the potential that 
hole experiences is not perfectly periodic, due to the random arrangement 
of Lit ions. This problem has been treated by Yamashita and Kurosawa 
(1958, 1960) and Sewell (1958). Since the carrier-lattice coupling is 
strong it cannot be treated as a perturbation, as is done in the usual theory 
of conduction by Bloch electrons. Ina more exact treatment of the lattice 
coupling, we find that the hole bound to a given Ni?* ion moves in the poten- 
tial of that ion and of the polarization field produced by the displacement of 
the surrounding ions from their equilibrium positions. ‘Thermal vibrations 
of the lattice ions are about the displaced positions. Thus the bound hole 
can be represented by the wave function 


¢;(r—R,) TTX, (@,(Ri)}, ace coe oJ CEY) 


P.M.S. be 
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which includes explicitly the interaction between lattice and hole. Here 
R, is the position of the ith nickel ion, r the coordinate of the hole, Q, a 
phonon coordinate and n, the number of phonons in the mode o. _ The 
Hamiltonian of the hole can be written 


H =H y+ H itt H,, 
where 


H => (P2+029, 2), 


so that #,, describes the lattice vibrations and Q, is the uigeanoe of 
the nde o. ‘The term 


Fimptlim + SU" -R)+ AT), 


describes the hole, with momentum p, moving in the field of the potential 
energy U(r—R,;) due to nickel ions at points R;. The sum is over all lattice 
sitesj7. AU describes the corrections to the periodic potential due to other 
positive holes and Lit ions. The lattice-hole interaction is of the form 


SH mnt = "> Clexp (iv F105 pate een Senne 
where t, is the wave vector of mode a, and C,, includes the coupling constant. 
A variational technique (see, for example, Frohlich 1954) is used to find 
an eigenfunction in the form 4; (r—R,)X,,(Q) to the Hamiltonian 

. HY=HO+ Himt+ Hy, 
where 

H = p?/2m + U(r —R,)+ AU (r). 
Then ¢; is required to be a solution of 


{p?/2m + U(r—R) + AU (r) + (X* | Hime | X)}o(r—R) =e: 4; (FR). (25) 
The term (X*|# |X) describes the polarization field of the surrounding 
ions. We see that ¢; is predominantly just the wave function of a hole 
localised in the potential U(r—R,). Similarly, X,,(Q®) must satisfy 


{>(P.? +0? Q,?) + ($;*(r) | H int | di(r) X= gg. Ss 2. (26) 
From (24), 
<bi* | Hint | $i) = VC, (Hi* |exp (i700) 1$:9Q,=D4Q,. . (27) 
By a change of coordinates, namely ; 


Q,=9,9-Co, CH=AP]o?, wi Ssh SB) 
(26) becomes 
Ld (P24+0,2Q,00?) Xp = en’ Xp. 
Thus X,, is a product of single mode oscillator functions, 
X= TTX, (2.91, 
each having the displaced pa a Q, ®, The energy corresponding to 
the state $,X,, is «;,, where 


Emn= G+ en =Gt dY(n,+ $)hw,— > {A |?/2e,. = ted 420) 
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We can now treat the potential (4% — #) due to the other lattice sites 
as the perturbation which causes the hole to move. This is analogous to the 
usual Bloch tight-binding approach. In the Bloch case, however, one 
starts with a product of an atomic orbital »b(r—R,) and phonon functions 
X,(Q), X,, being independent of the site R,. A hole (or electron) with wave 
vector k is the described by 


br > eaD (2kR;) yb (r — R,)} X,, (9) 
and has energy, for example, for a simple cubic lattice with spacing a, 


€% = €y + 2M, (coska). 
Here 
My= ={i( r—R LH — Hy] d; ( oor R,) dr} <{X,, eer 2 (29a) 
where sites 7 and j are nearest neighbours. We note that the bandwidth 
12.M,, and the effective mass m* of the hole (m* =/?/(0%«,/0k?) are indepen- 
dent of temperature, since (X,,|X,,) in the Bloch scheme. In a similar 
way, we can use the functions ¢, X,, to describe a polaron with wave vector 


b= ay f(r —R,) X,, exp (7kR,), 


Ex = €y9 + 2M cos(ka), 


and energy 


where now 
M=NMz {X,, | X,%). 


The polaron bandwidth and effective mass are temperature dependent, 
since the overlap (X,,|X,,”) of the phonon functions is not equal to unity 
but is a function of the number n of phonons present. We may estimate 
the magnitude of this overlap at a temperature 7’ by taking an averaget 


value: 
{X,9|X, 9) ={ TT (Xn, (Q.)| Xn, (Qo?) ave 
= 1 @sinh( (iw (2k) } Sexp{—(n, +3) ho [kT}<{X., (@®) Xn, ( (Q®)) 


no 


=e-%, Bae Vee in. Ga Menem Rhye) cat whe (OO) 


where 
S=1> (CO-CO)w, (20, + 1)/h, 


and 7, is the mean number of phonons in mode o at temperature 7. The 
polaron effective mass m* is therefore larger by a factor eS than that 

calculated from the Bloch tight-binding model, and increases with tem- 
perature. This is because the moving hole must carry with it the polarisa- 
tion of the surrounding lattice. Yamashita and Kurosawa (1958) have 
quoted values of S at 7'=0 (appropriate to Li, Ni,_,O ranging from 15 for 
the smallest value of x to 4-5 for the larger. “Thus the mobility of a hole 
propagating as a wave may be very small. 


ede MMPS AT on PT Reg ae a ed TD OT ee 
+ The averaging above can be performed by a method due to O'Rourke (1953). 
M2 
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The magnitude of Sis a suitable criterion for giving meaning to the terms 
‘weak’ or ‘strong’ coupling. The carrier-lattice coupling is strong if 
S>1, and conversely when S <1 the coupling is weak. Thisisa pes 
definition, for from (28, 30). 
S~}<(Al,/l.)2) 


where Al, ~C,/M? is the shift in the centre of oscillation of mode o, 
lL =(h/Mw,)"” is the mean amplitude of oscillation of mode o at 7’=0 and 
the averaging is over all modes c. Thus, for example, the polaron energy 
spectrum ¢, tends to that of an electron in the usual Bloch tight-binding 
approximation as S->0, as would be expected for small coupling. 

In the following paragraphs we shall indicate how the hole mobility 
is calculated when the states are localized, so that the conduction 
mechanism is that of a hole jumping to neighbouring sites with the emission 
or absorption of phonons. The probability per unit time that a hole jumps 
from site 7 to 7, with a change in the phonon state from n to n’, is 


W ins int = (277/h) | Mins jn’ |? 5 ( (Ein — & ae B . . . . (31) 
where the energies ¢;,,, are given by (29), and 
Min, jn’ = (p(r—R; R,)X n?| #n—- Ay |d(r—R,)X, M), 
To obtain the total transition rate W,; of the hole from 7 to 7, Win, j., must 
be summed over final phonon states n’ and averaged over n. Thus 
Wig= (27 /h)| Mp PX Pn [Xn | Xn DP d[Ae+ d(m9— Mo’) hg]. (32) 
n,n’ 


Here WM, is as in (29a), p, is the probability that the state n is occupied at 
temperature 7’, and for the energy difference <j, — €j,, the value (29) has 
been substituted, and - 


Ace=¢,—€;= (¢;| AU(r)|4;)— <$; | AU(r)|¢;). 


The evaluation of the summation in W,; has been performed by several 
authors eae are given by Varnnckiea and Kurosawa 1960). We have 


Via lM Pexp[— S(T th exp [iAet/i+G(T,t)]dt, . (38) 
where 
G(T, t)=4> {(2%, + 1) cosw,t+isin w,t} w, (C,0-—C)(CCW)2/h (34) 
and S(7’) is as derived earlier (30). We note that an expansion 
exp [G(7',f)]=14+G644G8+)...,0 05. (35) 


and a term by term integration over ¢ gives essentially the contributions to 
W;; due to zero, one, two . . . phonon transitions (since @ is proportional, 
through C,°, to the phonon-hole coupling constant squared). Q(T, t) can 
be seen to have a magnitude comparable to S for a range of t values around 
t=0. Iftis outside this range, G(7’,t) tends rapidly to zero. As we have 
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seen, S can be as large as 15 for Li,Ni,_,O. Hence if S is large enough the 
transitions in which phonons are emitted or absorbed will dominate the 
zero phonon rate, and as discussed in § 5 the electron will be ‘self -trapped’ 
at site?. Similarly, the higher terms in the expansion (35) (multiphonon 
transitions) will dominate the term linear in G (single phonon transitions). 

The evaluation of W;; from (33), which has been treated in the previous 
reference, is difficult and will not be discussed further here, since we are 
primarily interested in impurity conduction in valance semiconductors. 
We note that, assuming the Hinstein relationship, a hole mobility » may be 
calculated, 

p= (ea /kT)W;,, 

where a is the spacing of the Ni?* ions (the mean jump distance of the holes). 

We now discuss the case of weak coupling, appropriate to impurity 
conduction in valence semiconductors. We consider the motion of a hole 
bound to a donor atom (in n-type germanium with a small degree of com- 
pensation, say) surrounded by randomly placed charged donor and acceptor 
centres. By the same methods as in previous paragraphs we find that the 
strength of the interaction of the hole with the lattice depends on S, which 
can be estimated through the deformation potential constant H, (Bardeen 
and Shockley 1950). The ‘displacement’ C, occurring in S is, from 
(25-28), 


5 2G) 22 : 
CP= Wy = {¢(r— R,)| exp {it,(r — R;)}| 6(r— R;) > 
iTo 
EH oleh GayP yA a), he oo ae S88) 


assuming that 4(r — R;) describes a donor electron in an s level, with effective 
-Bohr radius a). If the phonons are in phase at site 1, the phases at site 7 
will be random; thus C’,%) can be neglected. The sum over o can be re- 
placed by an intergration, using a Debye spectrum (cf. Lax and Burstein 
1955). 


3 


s= (2) 00Purdo= A 
0 h o > 


May? (kOp)*’ 


(37) 


where Reeves 
and @y is the Debye temperature. Using constants appropriate to n-type 
germanium, H,=15ev, a,=444, @)=362°K and M=2-46 x 10- g (twice 
_ the mass of the lattice atom since there are two atoms per unit cell), we 
find S=0-10. At finite temperatures, 


: h T\2 
S(T) = S, [ GlesePeoth 0 2 dw ~ 8(0) {1+3(5) | (38) 


2kT' 
when 7'/A is small. Here, if v is the velocity of sound in the crystal, 
kA =hvla,y is the energy of a phonon with wavelength comparable to dp; 
these phonons interact most effectively with the localized electrons. For 
germanium, using v=5:3 x 10°cmsec™, we find A~ 9°K. Hence S(T) 
is small throughout the temperature region in which impurity conduction is 
important (7 S5°K) in germanium. 
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The transition probability W,; can be calculated in a similar manner to 
that for the strong coupling case. We study the motion of a single electron 
in a random lattice of N+1 positive and NV negative centres. This is 
equivalent to n-type impurity conduction, in which all except one electron 
are assumed to be stationary. The electron Hamiltonian is 


KH = p?|2m* + Yv(r—R,) — > o(r — R,) 
v a 
where 
v(r—R)=e?/k|r—R| 
and R, is the coordinate of a negatively charged centre, R; that of a positive 
one. The overlap between electron wave functions centred on adjacent 


positive centres must be treated carefully, since the conductivity will vanish 
in the absence of overlap. Let 


Uu;=u (r—R,) 
be the wave function of an electron on an isolated donor, and 
. (a; | uj) = Ay (49) 
be the overlap integral. Then (Léwdin 1956), the funtions 
r)= > Aj? u,(r) TAS Ra ee 
Jj 


form an orthogonal set. The sum is over all positive centres. Matrix 
elements of #¢ in terms of the ¢; are 


<$;| He |; 2S °6; i Viz» 


where ¢° is the energy of an isolated donor electron, and 
= »9 Ay 1? Via Aree . . . . . . (40) 
kyl 


V,,/° is the matrix element of the electrostatic potential of all centres except 
the /th positive one; so that 
Vigo = (u,(r) | So(r —R,) — pers R;)| w,(r) >. See at A 
v tl 
Thus the ¢; describe electrons with energy ¢;, 


g=et+ Vy, 


the energy «, arising from the potential of the surrounding randomly 
placed charged centres. Since the overlap A,; is small, 


bi(r) ~ u,( r)—3A, U; (r). 


ieee 
Thus ¢, is mainly localized about the ith positive centre, and the off-diagonal 
elements V;; lead to transitions of the electron from i to j. 

The electron-lattice interaction can now be treated in exactly the same 
way as for the strong coupling case. We find that associated with the 
state ¢,(r) is a set of lattice vibrations described by the oscillator functions 
X, (Q.) ; n, is the number of phonons in mode co. The displaced coordin- 
ates Q,.” are by (28) 

Q,°=Q,+ 0.9, 
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where Cis given by (36) to the lowest order in the overlap. The expres- 
sion for the transition probability W,; is therefore of the same form as (33) 
with V;; replacing 1; 


> 


l 
Wy= mat V,, Pexp{—S(T)} [exp (G(T, t) +2 (e—e;)t/i}dt, . (42) 


where G(7’, t) is given by (34) and 
e;— €;= Vig— Vij ~ Cu, | v(r—R,)| u;,) — Ay Cu; | o(r — Ru,) 

to the lowest order in the overlap and neglecting all but the nearest- 
neighbour centre 7. Since S(7’) and therefore G(T’, t) is small, the single 
_ phonon contributions to W,; dominate, and thus in (42) exp@ can be 
replaced by 14+ G. 

Assuming a Debye spectrum of lattice vibrations and performing the 

sum over modes o as previously indicated for S(T), 


1 cs ‘ 3 ee ; 
Wi ral V,; |? ie . dt exp {t(€;— e;)t/h} —— | _ da exp (it) 


iy) 
am Sue) 


w? hw 
ES OO) GON) 7 faa Ses, 
x oF: : | ( +coth =r) 
2 — . 
=3r|V, piesen eae PRI ASS 


Mhv*(kOp)*® [1+ ((e;— €;)ao/2vhs? 4 
Gite 
x {1-+eoth SI. SO ete 21 Ried nn ee eee Ce), 
We have used the'value of C,” calculated in (36). 
The two phonon processes, which involve an integration over {G(7’, t)}? 
give a contribution ~10-* times the single phonon transition rate (cf. 
eqn. 35). 


§ 8. CALCULATIONS OF ImpuRITY CONDUCTION 


The theory of impurity conduction has been studied by a number of 
authors: Aigrain (1954), Baltensperger (1953), Conwell (1956), Erginsoy 
(1950, 1952), Mott (1956), and Kasuya and Koide (1958). In this section 
we shall outline the recent work of Miller and Abrahams (1960) and 'Twose 
(1959) on conduction in the low concentration region in germanium and 
silicon. The former authors use the following method. 


(a) The probability per unit time W,, that an electron jumps to a neigh- 
bouring vacant site is calculated, as a function of the separation between 
the two sites. A phonon is emitted or absorbed to conserve energy, as 
explained in the previous section. 

(b) Equations for the rate of change of the probability that an electron 
occupies a given site and the net current flow, in an electric field, are written 
down and shown to be equivalent to Kirchoffs’ Laws for charge flow in a 
three-dimensional random resistance network. Lach link in the network 
corresponds to two impurity centres; the link impedance is inversely 
proportional to W;,, the transition rate between the centres. 
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(c) The network resistivity is computed, assuming that it arises from 
non-intersect chains of impedances taken in parallel, each link in the chain 
being chosen in a suitable manner. 

In evaluating W,; in the previous section we assumed a simple hydrogenic 
form for the donor electron wave function, 

u,(r) = (749°)? exp (— 1/49). 
The correct effective mass wave functions have been described in § 3. Using 
these wave functions, and averaging over the possible directions in the 
crystal of the neighbouring vacant site, these authors found 


Giperaes ee _ U2A {eoth (A/2k) + 1}, 


2p v? 
where 
U? = (2e?/3xa?)? (7a/40R)"?R?/n exp (—2R/a). 

Here p, is the density and v the velocity of sound in the crystal, and #, 
the deformation potential constant. The coth describes the number of 
phonons present with energy |A|, where A is the difference between the 
energies of the two centres. is the distance between the two centres and 
a=a?/b2—1; a, b are the Bohr radii and nm the number of conduction 
band minima (§ 3). 

If f, is the probability that centre 7 is occupied by an electron, 


j 
This determines the equilibrium distribution. As in § 2, 


f,=1/[1 + exp {(e,—)/kT}], 


where ¢ is the Fermi energy. From considerations of detailed balancing, 
we have 


W,, exp (€;/kT’) = W,,exp (¢/kT). 


In the presence of a small electric field F in the a direction, a different 
equilibrium distribution f,;/ of electrons is formed, determined by a steady- 
state condition of the form (44) but with f; replaced by f;’, and the transition 
probabilities which are altered by the field now obey 


W,; exp {(€;— ex, F)/kT}= W,,exp {(e,—exF)/kT}; . . (45) 


az, is here the x coordinate of the ith centre. f;' can be written formally 
(assuming local equilibrium), in terms of a parameter E,, as 


fi =1/[1+exp {(E,— C)/kT}}. 


Substituting this form for f;’ in the steady state condition, using the recipro- 
cal relation (45) and taking terms linear in F only, we find 


1/kT) Deg= ex,F) —(e;—ex,F)} Wf, l—fp= 


This is of the form of Kirchhofts’ first law for a network, with the term { } 
corresponding to a potential drop across the ‘link’ ij, and the link im- 


pedance, 
Zi x{Wiyf(1—-f,)}7. 
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The problem is thus reduced to determining the impedance of an equivalent 
resistance network. 

The solution of the network problem, in terms of chains of conducting 
elements, the chains being taken in parallel, is described in detail by 
Miller and Abrahams (1960). The final result for the resistivity can be 
written 


p(L’) =C(T)(rp/a){1 + 18-2(a/rp)?} exp {1-09(rp/a)3? + (5 — €e)/kT} (46) 
where 
O(T) = 4-55 x 1021o(T)(/8)¥2 «2np, v°H4a3 [eS E2 
and. 
fp=(3/4aNp)'?, a= —14+.a?/b. 


Here a and 0b are the radii of the effective mass wave functions, is the 
number of conduction band minima, p, the density and v the velocity of 
sound in the crystal. In the discussion above, the donor electron was 
assumed to occupy the ground state. Je and e€. contain the effects of 
excited states, which are shown to be unimportant (Je=1, «,=1) except 
in the case of antimony doped germanium (cf. §3). Thus p(7') is of the 
form 
p(7’) =Po (Nai) exp (e3/kT’). 


In this calculation the whole of the compensation dependence is included 
in the activation energy ¢€3, which is given by 


€,=C— 1:35 Ea, €a=(€7/K)(4rN,/3)48. 


The determination of the Fermi energy ¢ (on the assumption that the 
energy difference ¢;—«; between sites is due to nearest neighbour charged 
minority sites only) has been described in§2. Ifthe compensation is small 
(K 0-2) 

€3 = €p — 1-35 €, = (€?/K) (40 N p/3)"8 (1 — 1-35K 13). 


For higher values of K, values of «, are plotted in fig. 8. Although, as we 
have seen in § 4, the magnitude of «, agrees well with experimental values, 
the temperature dependence of p(7’) is incorrect since it predicts that all 
curves plotting p against 1/7’ for the same concentration of majority 
impurity centres extrapolate to the same point at 7’= 0. 

Calculated values of p from (46) for antimony-doped germanium are 
compared with measured values in table 3. The agreement is satisfactory, 
considering that errors of at least 30% are possible from uncertainties in 
the effective Bohr radii and the deformation potential constant F). 


Table 3. Resistivity p of germanium doped with antimony (Q cm x 10°) 


Np x 10-15 em-3 6 2-3 . 
-8 x 108 3-2 10? 50 
‘1x104 | 5-2x10? 89 


1 
p (2°5 °K) . 5 
p (calculated) 1 
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Twose (1959) has also calculated the impurity resistivity in the low 
concentration region by a somewhat different method. A density matrix 
approach was used, similar to that of Luttinger and Kohn (1958), using as a 
representation the product ¢,(r)X,,(Q) of electron and oscillator functions, 
described earlier (§7). This leads to an average electron drift velocity <v) 
which is a sum of ‘two centre’ contributions v,;. Each v;; depends on the 
charge amplitudes of the electron on sites 7 and j, and hence <v) should 
be found from a self-consistent treatment of charge diffusion onto and 
away from a given centre. Instead of this self-consistent approach, the 
electron was assumed to occupy sites with a Boltzmann probability, leading 
to a two centre conductivity for Ve electrons of the form 


Oi; = Ne F(R) (L, €;j)- 
This must then be averaged in suitable way over the random separations 
R,; of the impurity centres. Here 
F( Rj) = {87H Ph?/Ma,? (kOp)*}| (4:10 14; P, 


where J is the mass of the atom, ©) the Debye temperature, and ¢ ) is the 
matrix element of the velocity between the localized electron wave functions 
on centres i andj. The temperature and energy dependence of a is con- 
tained in the equation ' 


a (e;— €;)/2kA 1 €;— &; 
Nef a= Tae mRARFED O88 Seep 
exp(—¢/kT)/Sexp(—¢/kT). . . . . . (47) 


e, is here the energy of an electron on site i, kA=hv/a, is the energy of a 
phonon of wave length comparable to the Bohr radius a, of the localized 
electron, where v is the velocity of sound. Since ¢;—«; is independent of 
R,; except for very small separations, the dependence of o,; on separation 
is almost entirely contained in the term |<{ ¢;|v|¢,)[?. 

In the case of small compensation, f(7', #) can be approximated by use 
of the trapping model (§2). The electron jumps between the ‘free’ sites, 
where ¢;~ €;, and the Boltzmann term in (47) 


Ne exp (— ¢(kT)/¥ exp (—¢;/kT), 
leads to the number n, of free carriers, 
= (Minin may)! exp (— €t/2kT7), 
where ¢; is the energy difference between free and trap sites. Then 
F(T, €)=n,[kA. 
The conductivity becomes 


py = (Bre Bh May v(kOp)}| (6; |v 1442. 


Thus the mobility is independent of temperature. 


where 
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Assuming hydrogen-like wave functions for the localized electrons in 

evaluating the velocity matrix element, it was found that 
me Hi? 

°F tM (Ope MY EXP 2%) Mas N nn 
where v= R/a). When averaging o, over the random centre distribution, 
care must be taken not to over emphasize the contribution from pairs of 
very close centres, since the electron will tend to be trapped on such a pair 
rather than move on through the lattice. Onerough estimate of the average 
conductivity ¢ is obtained by assuming a constant impurity separation. 
Figure 17 shows that the observed resistivity is best fitted by 1/é if 


)Y? exp (— e¢/2k7') (48) 


Fig. 17 


resistivity (ohm-cm) 


(@) |2 |4 16 ifs} 20 BP) 24 
| /(N'3a.4) 


average centre separation v of (a) N—1, (b) 0-7 N—1/ and (c) 0-62 N18 
The crosses are experimental points (Fritzsche 1958). 


The resistivity for n-type germanium calculated from (48) assuming a constant 


v~0-7/N8a,, implying that conductivity takes place preferentially along 
chains of impurities whose spacing is less than average. In evaluating ¢ 
in fig. 17 we have set $e for ¢,, the observed activation energy. 
Another averaging procedure, suggested by Pippard, is the SOuONang 
Suppose that a pair of centres, separated by a distance and with ‘ two 
centre ’ conductivity o(R), can be replaced by a sphere of radius F and a 
uniform conductivity o(R). Let this sphere be imbedded in a medium 
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with the average conductivity & of the disordered lattice as a whole. When 
a field F is applied, the field inside the sphere becomes 

F(R) =36F |{26 4+ o(R)}. 
The current density inside the sphere is F(R) o(R). If the probability of 
finding a sphere with radius between R and R+dR is P(R)dk, then the 
average current density j is given by an integral equation 


3G0(R) 
= cere ities 
ae {7 PUR) Se a(R) : 


For a random impurity distribution the probability function is 
P(R)=4rN,,,; R? exp (— 42 N,,,,; R?/8). 


maj 
The equation was solved numerically. 

Calculated values of the resistivity p are compared with measured 
values pexp for antimony doped germanium (Fritzsche 1958), at a tempera- 
ture of 2-5°K, in table 4. The observed activation energy ¢, has been used 
to estimate et, through the relation «,=4e, given by the trapping model 


(§ 2). 


‘ 


Table 4 


Np cm p (Q. em) Pexp(Q . em) 


9-3 x 1014 ‘Ol: 6-5 x 1014 7-1x10"4 
1-6 x 1015 . 5-2 =x 109 6:3 x 10° 
2°3°x 1015 . 7-4x 108 3-2 x 108 
3-0 x 1015 . 1-8 x 108 7-1x107 
5-2 x 1015 . 2-3 x 10’ 5-6 x 108 
8-5 x 1015 : 1-4 x 108 1-4 x 108 
1-3 x 1016 . 6-6 x 104 2-2 x 105 


Both the above calculations are based essentially on choosing an average 
transition rate W,; of the electron between two centres. The averaging 
procedure is critical, since W,; increases exponentially with decreasing 
centre separation. As mentioned earlier, the large transition rate between 
two centres spaced closer than the average does not imply there is a 
correspondingly large contribution to the average conductivity, since 
the electron may have difficulty in proceeding from j to more distant 
centres. The charge on j will then build up by an amount which depends 

on the charge distribution on surrounding centres and on the applied 
electric field, until the forward transition rate iJ is balanced by the back 
transition rate 77. 

Another point is that if f; is the probability that centre 7 is occupied, the 

ee that the nearest neighbour j is unoccupied will be larger than 

—f;), due to Coulomb repulsion between the electrons. Miller (private 
ee aaans points out that this neglect of correlations between 
electrons may be the cause of the incorrect compensation dependence of his 
results. To conclude, a rigorous calculation of the conductivity will 
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involve setting up a transport equation in which the charge distribution in 
the presence of an electric field is treated self-consistently, and with carrier— 
carrier correlation taken into account. A part of this programme has been 
completed by one of us (W.D.T.) in that the correct transport equation has 
been derived in the approximation that correlations are neglected; as 
might be expected the equation takes the form of a generalized Einstein 
relationship. The inclusion of the important self-consistency and correla- 
tion effects is being investigated. 


Part IJ]. THe Transirion to a Merautic Form or Conpvuctriviry 


§ 9. InTRODUCTION 


It is a property of silicon, germanium and of many other extrinsic 
semiconductors that, as the concentration N of impurities increases, the 
activation energy e, for conduction in the temperature range for impurity 
conduction decreases and, at a critical value N,, vanishes. For values of 
N greater than NV , the resistivity and Hall constant are roughly independent 
of temperature down to the lowest temperatures for which measurements 
have been made. 

In a number of papers one of us (Mott 1949, 1952, 1956, 1957, 1961) 
has given arguments to suggest that a sharp transition from a metallic to 
a non-metallic state must occur for a crystalline array of atoms as the 
distance between them is increased. In the paper published in 1956 it was 
suggested that the transition described in this paper for a random array of 
centres is of this type, the sharpness being lost because of the disordered 
arrangement. Weconsider that this isthe correct explanation. However, 
another explanation is certainly possible, namely that the transition is one 
due essentially to the disordered lattice and occurs at the concentration at 
which the states for a single carrier become localized. We shall have to 
examine the evidence that this is not so at any rate for low K. 

We shall not repeat the arguments summarized by Mott (1961) that this 
transition occurs for a crystalline array, remarking only that, while an 
exact calculation of the transition concentration has not proved possible, 
it should occur at a constant value of about 3 of the constant 


where a, is the Bohr radius of the centre. 

We shall now summarise what we think ore in the disordered 
lattice in the two limiting cases, K <1 and 1—K <1, where K is degree of 
compensation. - 


(2) Low compensation, K<1. In the region of low concentration, 
(N/N <1), the carriers are holes, bound to minority centres by a Coulomb 
field e2/xr?. As N increases, therefore, the holes cannot become free, 
because a Coulomb field always leads to bound states. The transition 
concentration NV, should increase slightly with compensation K, because 
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some of the majority centres are empty and so the amount of overlap 
between localized centres is decreased (see below). 


(b) High compensation, 1—K <1. Here we have at present little experi- 
mental evidence. The theoretical predictions are the following. As 
the concentration V of majority carriers is increased, a value will be reached 
for which the electrons are no longer in bound states (§ 2). Thus n electrons 
per unit volume (where n =(1—K)N) move in the random field due to fixed 
positive and negative charges; this random field is thus not strong enough 
to give bound states. There is thus the possibility of a ‘ crystallization’ of 
electrons as predicted by Wigner (1938). This is discussed by Mott (1961). 
We are convinced that, whether or not the electrons behave like a classical 
liquid or a condensed electron gas, their conductivity will be high and that 
in this case any transition which may be observed will be due to a transition 
from bound to free states for a single electron. 

The remainder of this paper will deal with the case of small or moderate 
value of K, for which most of the experimental work has been done. 


§ 10. DEPENDENCE OF THE TRANSITION CONCENTRATION OR DEGREE OF 
COMPENSATION 


If the transition were due essentially to the properties of the random 
lattice, we should expect it to depend very sensitively on K, since K 
determines the random field. If on the other hand the transition is due 
simply to overlap between occupied centres, we should esnec a variation of 
A according to the formula 

A= const (1— K)", 
Some experimental support for this compensation-dependence of A is given 
by measurements of the acceptor separation rs at which the activation 
energy ¢; disappears in p-type gallium-doped germanium (Fritzsche and 
Cuevas 1960b). The results are summarized in table 5, and are shown in 
fig. 18. 
Table 5 


ri. (2) 127 | 98 


K 0-04 . 0-4<K <0-7 


The critical separation decreases as K increases. We also note that the 
ratio of the separations for K equal to 0-04 and 0-4 is 1:18, while that 
calculated from (48) is 1-12. 

The experimental value of d is not clear, since the gallium wave functions 
are not accurately known. A variational calculation shows that at large 
distances from the impurity the wave function is a sum of two exponentials. 
One has a Bohr radius a,=35+7A, the other ag=89+2A4 (Miller and 
Abrahams 1960). Then for K =0-04, 


Ay=1s/a,=3°6, Ag=rs/a,=1-4. 
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Since the relative amplitudes of the two exponentials are not known, we 
can only deduce that 1-4<\<3-6. However, A is probably closer to 3-6 
than 1-4. One reason for this is that in the alkali metals (where the 
conductivity is of course ‘ metallic’) the inter-electron separation in units 
of the Bohr radius range from 3-22 (lithium) to 5-57 (caesium). 


Fig. 18 


€,(T>0),milli-ev 


0.5 


~ 


€,, (Miller) y 


(0) DOO Piers 1,000 
Average Separation d=0,62 N~3 Angstrom Units 


Activation energy «, of impurity conduction of transmutation-doped p-type 
germanium as a function of average impurity separation for K=0-4 
and K =0-04 (Fritzsche and Cuevas 1960 b). 


§ 11. CoNnCENTRATION AT WHICH THE TRANSITION OccURS 
In fig. 19 we show the resistivity at 2-5°K of weakly compensated samples 
of n- and p-type silicon and germanium. ‘This is plotted against A, in 
other words (3/47N)"8/ay or rs/a, where rs is the mean distance between 
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impurities. The Bohr radius a) used here is obtained by assuming the 
hydrogen atom model for an impurity centre, and adjusting a, to fit the 
observed ionization energy of the impurity centre. The activation energy 
e, drops rapidly in the transition region (shown in fig. 19 between the 
vertical dotted lines) and occurs for all substances roughly when A~3, 
or N4%q, = 0:2. 
Roughly the same value is deduced by Dewald (1960) from the measure- 
ments of Thomas (1959) for zine oxide. 


Fig. 19 


resistivity (onm-cm) 


D7, Ao Be LB Wye ia 
0:62/(N'3q,) 


Variation of resistivity with average impurity separation for weakly compen+ 
sated specimens of germanium and silicon. The transition region is 
‘enclosed by the dotted lines. The activation energy ¢, for impurity 
conduction vanishes at the small separation end of the transition region. 


MclIrvine (1960), by examining a number of semiconductors, deduces an 
empirical formula which relates V, to the static dielectric constant e. 
However, for semiconductors that are not elements, it is not clear what 
value of the dielectric constant one should take, the high frequency value, 
the static value or some mean between them; this point needs further 
investigation before an assessment of the meaning of McIrvine’s formula 
can be given. 

What is happening in the transition region is not clear. We have seen 
(fig. 2) that the ratio of the Hall constant to 1/N maj drops in this region. 
This suggests that the material is inhomogeneous, due to fluctuations in the 
density of centres, and that there are small metallic regions in series with 
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non-metallic ones, whose relative volumes depend on temperature. If 
the Hall effect in the non-metallic regions is small or non-existent, one 
would expect, as the temperature is lowered and the volume of the non- 
metallic regions increases, a drop in the observed Hall constant. This is a 
possible explanation of the effect. ; 

It should however be pointed out that Read and Katz (1960) have ob- 
served a Hall effect for ionic conduction in KCl; the hopping process of a 
come defect from one centre to another is not dissimilar to that described. 

ere. 


§ 12. Resistiviry IN THE Recion oF Metatiic ConpuctTION 


The conductivity of a metal may be written 
| ONG ie Ontg et eR ay ee ahh thoes (4D) 

where J, v are the mean free path and velocity at the surface of the Fermi 
distribution. Moreover, for a degenerate electron gas, 

m*v|hi = 2n(3N [S7)*3, 
Also it is convenient to write 

| l= p/N13, 

so that p is the number of interatomic distances on a mean free path. Then 
(49) becomes 

o = N13 ep /h(3/7) 8, 
or in ohm~ cm“, if NV is in particles per cubic angstrom, 

o=7x 103 pN12, 

For the specimens with highest conductivity shown in fig. 4, N is 10-6 
and the observed conductivity is 1-5 x 107, so pis about 2. This is shown 
also in the following table (Fritzsche 1960, private communication) for 
n-type germanium, in which v is deduced from (50) and hence the mean free 
_ path from the observed resistivity p. It is of course assumed that each 
centre contributes a free electron, so N=N,. The last column shows the 
mean distance between centres. The table also shows that / tends to 
remain constant as NV decreases, so p apparently drops below unity. 


Table 6. 


X 


p (obs, Q.cm)} Np (cm-%) | v (cm/sec x 10~8) | 7 (em x 108) 0-62 N-18 
\ (cm x 108) 


0-02 ral RUE 6 62 106 
0-0095 Geli". 8-6 63 73 
0-0067 1018 10 64 62 
0-0033 BOC LOre 15 63 43 
0-00157 Tou 22 59 29. 
0-001 2. 107% 28 58 23 


0-00066 4x 1019 35 55 18 


P.M.S, 


156 N. F. Mott and W. D. Twose on the 


We should expect / to be given by a formula of the form 
1t=N | (0) (1—e0s8) Sein hdd. oan 
where /(0) is the differential cross section for scattering by each centre. 


Fig. 20 


HALL CONSTANT R 


Tr 


Hall constant near transition concentration in metallic region. 


Fig. 21 


CONDUCTION BAND 


FORBIDDEN BAND 


An impurity band for concentration near the transition point. 


That, in the metallic region, the integral is of the order of N-28 ig perhaps 
not unexpected for a totally disordered lattice. The constancy of 1 
however, means that the integral is proportional to 1/N, and this is aie 
expected. A crude application of Born’s approximation would suggest 
that the scattering potential energy function of each centre was of order 
e?/«R (47-R3/3 = N) extending over a volume R?, so that 


1 2m e2 


Tyas pe (expicsle : 
tell Ge iz | exptik pow al he 
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the integral being over a sphere of radius R and |k—k’|=2ksin40. Since 
KR is independent of NV. this must be equal to a numerical factor multiplied 
by R*/a,?, where ay is the hydrogen radius h?«/m*e?. Thus / should vary as 
R-,i.e.as N83, , 

The observed constancy may perhaps be due to the non-Coulomb part of 
the field near the impurity atom, the relative importance of which will 
increase with JN. 

A point about the conductivity near the transition point may be pointed 
out. The Hall constant plotted against 1/7’ shows the behaviour sketched 
in fig. 20 (cf. also fig. 4). This suggests that there is an ‘impurity band’, 
formed from the impurity wave-functions, and not yet overlapping the 
conduction band. If the impurity wave functions are hydrogen-like, it 
is easy to estimate its width. This is, using the approximation of tight 
binding (Mott and Jones 1936, Chap. III, Mott 1957) 


B=6W,(1+’)exp(—2’) 
where W,, is the ionization energy of a centre. The assumption is made 


that the centres are arranged in a simple cubic lattice of side b, and 2’ 
isthusb/a,). Values of the ratio of Bto W, are 


nN B/W, 
2 3°0 
3 1-2 
4 0-54 
5 0°24 


Only values less than unity have any meaning in this approximation. The 
experimental value of A at the transition point is c. 3, and if we set 

N = (47/3)48A = 1/ Nay, 
a value of about 5 is appropriate for \’._ Near the transition concentration, 
then this rough estimate suggests, as do the experiments, that the impurity 
band does not yet overlap the conduction band. . 
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APPENDIX 


Tur ExIsteNce of Bounpb States IN THREE DIMENSIONS 


We give here an outline of an extension due to one of us (W.D.T.) of the 
work of Anderson (1958) showing that the single electron states are bound 
in an impurity lattice when the average impurity separation greater than a 
critical value ; the degree of compensation is assumed close to unity. 

N2 
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We study the wave function of an electron in a lattice of positive and 
negative fixed point charges which are distributed at random in a dielectric 
medium. There is a density N of positive charges, and the total number of 
positive charges exceeds the number of negative charges by one. Let 
¢;(r) be the wave function of the electron localised on the ith positive site. 
For simplicity, we here assume that ¢; is a hydrogen-like s function of the 
form 

(7) = (a9?) *? exp (— 1/9), LOS Ae es RY 
and that ¢,, 4; are orthogonal; we thus neglect the small overlap (4; ¢,). 
(A more exact treatment can be given in terms of the orthogonalized 
¢; of §7, (eqn. 39). The wave function of the electron at time ¢ can be 


written 
¢ p(t) = 2 aj(t) (7). 

We assume the electron 1s initially localized on atom 7, so that a,(0)=1 
and a,(0)=0, and study the variation of a,(t) with ¢ using the Schrodinger 
equation : 
th da,/0t = Ha, + > Vj; 4;. Thy Se eee aa) 
Here H, is the energy of the electron on site 7 in the coulomb potential 
of the surrounding positive and negative charges. Vj; is the matrix element 
of the potential of all charged centres except the jth positive one: 


Vy= (#| 3a - Pa x|r—R,| $i), 


kj 
R;,, R, are the positions of the positive and negative centres respectively. 
It is convenient to study the Laplace transform of (A 2), defining 


fi(s)=s | : exp (—st) a; (t) dt. 


Thus s is an inverse time, and 
lim a,(t) = lim f,(s). 
t>0oo 80 


Then, it follows from the work of Anderson (1958) that the behaviour of 
a,(t) at large times depends on the convergence of an infinite perturbation 
series 
V (8) ian > Vix (1/d;,){Vig+ pa Viem(1/dm) Vena ae ae (A 3) 
where : 
d,,=iths — E,.. Beery hie orien ey 
For, if ¢ is large, Anderson shows that 
a,(t) ~ exp (—t/7;) exp {—7(H;— A))t/h}, 
where 
lim V (s) = — A;+irfi. 
8>0+ 
The imaginary part of V, corresponds to a decay time, and the real part to 
an energy shift. Hence, if the Im (V,(s)) vanishes as Re (s) > 0+, the ampli- 
tude of the electron on site i remains finite as t-> 0, i.e. the electron is bound 


Theory of Impurity Conduction 159 


to that site and has energy H,—A,._ If the imaginary part does not vanish, 
then after a sufficiently long time the electron will have diffused completely 
away from its initial position; bound states are in general not possible at 
any centre 7. 

V (s) should be evaluated when 
Im (s) = (#;— A,)/h. 

Thus in (A 6) 

where sisreal. In the following we neglect the small energy correction A, 
and write 

The quantities V;; and H,; entering V, can be written 


2 | 
ie (bi v(t) - car $; ) shuscasiey at CAEN D) 
Ei; = Uy (R;) — V3 (R;), 


I 


ny(r)=erfe|r—R,|—_¥ ete] r— Ry 


is the Coulomb potential at r of all the randomly placed positive and 
negative centres except the ith and jth positive ones. 

The dominant contributions to V, come from small values of £;; and 
large values of V;;, From (A 1), 

Viy~exp (—B,;/a9), 
where &;; is the separation of i andj; thus JV;; is largest when 7 and 9 
are near-neighbour centres. In general, for two near neighbours, H,,; is 
small when there are no close centres to? andj. Then, it is a fair approxi- 
mation to neglect v,;(7) in (A 5), taking 
e2 


__ & (Bi 
Vise (4; x|jr—R| $; ) aE (3 2 1) exp (Fj;/a9). 


Ky 
Also we note that in this approximation V;; and H,; can be treated in- 
dependently, since V;; depends on centres 7 and j only, while #,; depends on 
the potential of all other centres. 
Let 


where 


V.= —A(s) + ths X(s), dees. 7 aC) 
where A, X ands arereal. Then, in the limit s—0t, 
X=> (Viel Bix V teil Bat > V tem V mil Liem Het +++» (AT) 
k m 


We treat the V;; and H,; as independent random variables, obtain a pro- 
bability distribution for X and show that the probability that X diverges to 
infinity is zero if the density N of charged centres is less than a critical 
value V,. Then (cf. A6), 

lim Im V ,(s) = 0, 

s>0 


and the electron remains localized about site 2. 
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The steps are the following : 
(i) Anderson shows that the first term in (A 7), 
aes 2 | Vix [?/ Bin? 


is convergent in the above sense if V;,(R;;) falls off faster than 1/R,,°** 
(e€<0). This condition is obviously satiated for our case. 


(ii) We must now show that the complete series V is convergent when 
N<WN,. Let 
SPACE S,=2(+T;) 


where S, is the sum of all terms of length L; the ‘length’ is the number of 
times VjE appears in the term. The sign of both S, and 7’, is random, 
because of the random sign of the energy Aatamiatars E;;. The number 
of terms of length Z, with value 7', in the range 7,—7,+d7',, is found 
to be of the form 

n(T',)dT ,=[f(N) L(7,) aT ,/T 77, 
where f(V) increases with NV. If Z(7’) increases, or decreases no more 
rapidly than 7'-1?, then the probability distribution of S, is of the form 


P(S7)dS;~ [Lf(N) L(S;) dS;/S;, 
for values of the sum S, greater than or of order the most probable value. 
Since we are interested only in the convergence of X, we consider the case in 
which ZL is large.. Then Vis determined by 
Lf (NV) £(1)=1. a oes et Sree Ol eee 

For, if N<WN,, f(N)<f(N,) and the probability of obtaining S,=1 is 
smaller by a factor of order e~. The number of S, in X increases as L, 
and the probability is 1—e-” that each term is less than unity. Hence X 
must almost always converge when N < J... 

The detailed derivation of n(7’) will not be given here. It can be 
obtained by the methods in Anderson’s paper, with the following assump- 
tions. 


(a) The probability distribution of the energy differences E;; is of the 

form 
P(E,;)=W if -4W<H,<3W 
=(0 if |H,;|>4W, wy cS cei oy ae ae 

(b) Viz= Voexp (—R,;/ap), ys anlios ote wh ete A Lae 
The number of terms V;; with values in the range V—V-+dV is obtained by 
assuming that the number of positive sites j at distance R,; from i is 
Then we obtain 


N(T)aT = oe : (1+In(7/4L)—4In(2V,/W)}4dT/T2; (A 11) 
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W can be determined as follows. We write 


8} 
Ve es ian MAW 
EA Bae Bi 


and k is any one of the positive or Pie centres. ‘Treating Z, as a step 
in a random walk problem, the probability distribution of Ei; is 
_ (Chandrasekhar 1943) 


6 hh , 
P(e)= =| exp ((H#r) exp | — 4x7 | sin {< (F — Fe zh? Ry. | dr 
=e 2 ik ik 
set haere: 


As discussed earlier, we are interested only in small values of ZH, when 
Ry, or Rj, > Rj. Hence we can approximate (A 16) using 


1/R,;—1/R,; = R,; cos 6/R? 
where F is the mean distance from 7 and j to k, and @ is the angle between 
R;; and Rk. Then 


] A 
PH) = 5 | exp (ir) exp{—4N (me®R,,/2x)®2 | [*%dr, 


where 


and 

P(0) = 21(5/3)«/{7?(4.V)?!9 e?/R,,\. Tein sea een ata G2 GEE 
The appropriate value of W to use in the rectangular distribution (A 13) is 
therefore 

W =1/P(0) =8-5e?/Kagn, 
where 47Na,3/3 = 1/n?, and n is the average separation in units of the Bohr 
radius. We have replaced &,, in (A 17) by its average value, 
= (3/47N)18 
Hence the critical value of n is determined from (A 12) and (A 15) by 
(5-46 — In n)/n? < 0-32, 
and thus on evaluation by n < , where 
N,=3°6. 

Our model corresponds to n-type germanium or silicon at the absolute 
zero of temperature, with degree of compensation K~1. The variation of 
n, (the separation of the donor impurities) with K is given by 

{5:46 —In (n/K13)\ n2 K¥3 = 0-32, 
since the energy spread W is determined by the number of acceptor sites, 
while the jump distance of the electron depends on the separation of donor 
sites. Thus 7, increases as K decreases from unity. 

A weakness of the above treatment is that V,, and H;; are treated com- 
pletely independently, whereas from(A 14) and (A 16, 17) it can be seen 
that both are functions of R,,. Thus we should carry through the treatment 
using the single quantity V/E as the random variable. Corrections to the 
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above treatment are being investigated. We believe that the value n,=3-6 
is an upper limit to the critical separation. For, we have allowed a finite 
probability for H;; to have the value zero for all separations R;;, However, 
for two very close centres i and j there should be a term 2V;; i in the energy 
difference H,; which prevents H;; taking a zero value, and therefore the 
series X (A 7 ) should converge at smaller values of the average separations 
between centres. 
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§ 1. INTRODUCTION 


Iv is well known that there are attractive forces acting between any two 
neutral atoms or molecules which are separated by a distance large 
compared to their own dimensions. These are known as van der Waals 
forces and are of a long-range nature: they decrease with distanc 
according to a power law and not as an exponential. 
These van der Waals forces are electromagnetic in origin. As was 
first shown by London (1930), they arise from second-order perturbation 
theory applied to the electrostatic interaction between two dipoles; the 
energy of the interaction being proportional to R-*. However, this 
approach is only valid so long as R is much less than the wavelengths A of 
the corresponding transitions between the ground state and the excited 
states of the atoms. For R>A retardation effects become important. 
The interaction of atoms taking retardation into account was considered 
by Casimir and Polder (1948) as an effect of fourth-order perturbation 
theory applied to the interaction between the atom and the electromagnetic 
field. (These calculations were repeated by Dzyaloshinskii (1956) using 
the modern invariant technique of Feynman.) In the limiting case 
RX the interaction energy is proportional to R~’. The existence of 
attractive forces between neutral atoms gives rise to analogous forces 
between any two macroscopic bodies whose surfaces are separated by 


+ Translated by M. G. Priestley. 


166 I. E. Dzyaloshinskii et al. on the 


very small distances. However, the calculation of these forces on the 
basis of the interaction between individual atoms (as this is usually done) 
is impossible. It would be valid only for sufficiently rarefied bodies, i.e. 
gases—a case which can have no real counterpart. In condensed bodies 
the close packing of the atoms materially changes the properties of their 
electronic envelopes, and the presence of some medium between the inter- 
acting atoms alters the electromagnetic field through which the inter- 
action is effected. 

However, as distinct from this ‘microscopic’ approach, one may 
approach the problem from a completely different and purely macroscopic 
point of view, in which the interacting bodies are considered as con- 
tinuous media. This approach is valid because the distance between the 
two surfaces, although small, is large compared to the interatomic distances 
in the bodies. 

The basic idea of the theory is that the interaction between the bodies 
is considered to take place through a fluctuating electromagnetic field. 
Because of the thermodynamic fluctuations this field is always present in 
the interior of a material medium, and it also extends beyond its boundaries. 
A well-known consequence of this field is the thermal radiation of a body, 
but it should be emphasized that this is not the only manifestation of the 
fluctuation field outside the body. This can most clearly be seen already 
from the fact that electromagnetic fluctuations exist at absolute zero, 
when there is no thermal radiation; at this temperature the fluctuations 
are of a purely quantum character. 

As well as the attractive forces between bodies placed close together, 
the same approach can also be used to study other effects in condensed 
bodies which are due to van der Waals forces, in particular, the properties 
of thin liquid films on the surface of a solid body. 

In a thermodynamical sense, all these effects show one general feature : 
they are all related to the non-additivity of the free energy of a system of 
bodies when van der Waals forces are taken into account. In all these 
cases the free energy is not simply proportional to the volume of the 
system, but for a given volume is also a function of the parameters which 
describe the relative positions of the bodies (for example, the distance 
between solid bodies or the thickness of the film.) It is this non-additivity, 
due to the long-range nature of van der Waals forces, which is a qualitatively 
new effect, which separates the contribution of these forces to the thermo- 
dynamic quantities from their much larger additive part. This non- 
additivity can easily be understood by considering the above-mentioned 
relation between the van der Waals forces and the fluctuations in the 
electromagnetic field. Any change in the electrical properties of the 
medium in a certain region will, by Maxwell’s equations, lead to a change 
in the fluctuation field which extends beyond that region. Therefore the 
part of the free energy which is related to electromagnetic fluctuations is 


not determined by the properties of the substance solely at the point 
considered, i.e. it is non-additive. 
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It should be made clear that when we speak of the fluctuation electro- 
magnetic field, we mean by this all the spectral components which have 
wavelengths which are large compared to atomic dimensions (we shall 
refer to these as the long wavelength fluctuations). In each case the 
important fluctuations are those whose wavelengths are of the same order 
of magnitude as the inhomogeneities of the system (e.g. the thickness for 
a film and their separation for the case of two bodies). All the properties 
of the long wavelength fluctuations, including their contribution to all 
thermodynamic quantities, are completely specified through the complex 
dielectric permeability of the body. 

In this way it is possible to construct a general macroscopic theory 
of van der Waals forces which has the sole limitation that all the charac- 
teristic dimensions of the bodies must be large compared to interatomic 
distances. In principle this theory is applicable to any bodies at any 
temperature, independent of their molecular nature (ionic or molecular 
crystals, amorphous bodies or liquids, metals, dielectrics, etc.). Since the 
theory follows from the exact equations of the electromagnetic field, it 
automatically takes account of retardation effects. 

A theory of the attractive van der Waals forces between bodies using 
these principles was first constructed by Lifshitz (1955). Application of 
the methods of modern quantum field theory made it possible to find the 
general formulae for the calculation of the van der Waals part of the 
thermodynamic quantities for an arbitrary imhomogeneous medium 
(Dzyaloshinskii and Pitaevskii (1959)). This enables the theory of 
Lifshitz to be extended to bodies separated by a liquid layer, and also led 
to its application to the study of the properties of liquid films (Dzyalo- 
shinskii Liftshitz, and Pitaevski (1959)). 

We begin the exposition with a short summary of the methods of 
quantum field theory in statistical physics (§2). These methods enable 
us to develop the whole of the theory of van der Waals forces both naturally 
and generally. The subsequent exposition is designed so that the reader 
whose interest lies only in the results of the theory may omit §§2, 3, 4.1. 


§ 2. Tae Metuops or Quantum Frecp THEory IN STATISTICAL 
PHYSICS 


The widespread use of the technique of the Feynman diagrams is charac- 
teristic of modern quantum field theory ; it enables the structure and the 
nature of any approximation to be presented very clearly. 

As is known, physical quantities in quantum field theory are expressed 
through perturbation theory series in powers of the interaction constant 
(for example, in powers of the electronic charge e). Any term of the 
perturbation theory series can be described by the corresponding diagram 
and its calculation on the basis of this diagram is governed by the rules of 
the Feynman technique. In particular, for each internal line of the 
diagram there corresponds a free particle Green function G, or a free 
photon Green function D,, for each intersection of lines (a vertex) on the 
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diagram there is a certain interaction operator (in quantum electro- 
dynamics this is the Dirac matrix y, multiplied by the electronic charge) 
and, finally, an integration is carried out over the four-dimensional 
coordinates of each vertex of the diagram. 

The advantages of the diagrammatic technique show up most clearly in 
the solution of problems in which it is not possible to limit the calculations 
to a finite number of terms of the perturbation series, and it is necessary 
to sum an infinite series of the so-called ‘main diagrams’. The possibility 
of summing an infinite series makes the diagram technique especially 
attractive in quantum statistics, the usual methods of which make it very 
difficult to write down even the first two or three terms of the perturbation 
theory series. 

The application of the methods of quantum field theory to problems in 
statistical physics at finite temperatures is based on the work of Matsubara 
(1955), who showed that the free energy could be calculated by the use of 
the Feynman diagram technique. Each term of the thermodynamic 
perturbation theory series is, as in field theory, described by the corre- 
sponding Feynman diagram and its calculation proceeds along similar 
lines: each line of the diagram represents the ‘temperature’ Green 
function of a free particle @,; the vertexes of the diagrams represent 
the interaction operators. The only difference is that the functions G, in 
Matsubara’s technique depend not on the time ¢ but on the fictitious 
‘imaginary time’ 7 which he introduces ; this varies over a finite range from 
0 to 1/7’, the reciprocal temperature}. Analogously, the time integration 
from — oo to +o at each diagram intersection is replaced by an inte- 
gration between 0 and 1/7’. 

Here we shall review briefly the work of Matsubara. Consider, for 
example, a system of charged particles which interact with the electro- 
magnetic field. The Hamiltonian of this system is of the form: 


A =F. 0 +H int 
where H, is the Hamiltonian of the free particles and photons, and 


depends quadratically on the corresponding field operators »(r) and 
A,(r) in the Schrédinger representation, and Hint is the interaction operator 


ieee | A,(r)j,(r) Pr 


Ja(r) 1s the particle current operator{, which is some quadratic function of 
the particle operators ;(r). 

The thermodynamical properties of the system are determined by the 
statistical matrix, 


p=exp(—H/T), 


t In §§ 2, 3, 4.1, we use a system of units in which h=ce=1; temperature is 
measured in energy units. 

{ Here and henceforth Greek subscripts a, B=0, 1, 2, 3 denote the com- 
ponents of 4-vectors and tensors, while Latin subscripts 7, k=1, 2, 3 are used 
for the components of vectors and tensors in three-dimensional space. 
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and the free energy Ff may now be written as 
F=—TInSpp. 

In field theory the calculation of the average value of any quantity 
(here the average value of p) necessitates the use of the equations of motion 
for the field operators. Matsubara’s basic idea is the transform from time 
¢ to ‘imaginary time’ r, which preserves the formal similarity to the usual 
equations of motion. For this we transform to the appropriate ‘inter- 
action representation’, which is the analogue of the usual quantum- 
mechanical interaction representation, using the formulae 

A,(r, 7) = exp (rHy)A,(r) exp (—7Hy), 
o(r, 7) =exp (rH)(r) exp (—7Hp), 
}(r, 7) =exp (7H,)*(r) exp (—7H)), 
jt, 7) =exp (tHy)j,(r) exp (— 7H), 
Hini(7) = exp (7H) Hintexp (—7H); 
It is clear that 


Aint (7 = a= [ccc T)IAAr, T) d?r. 


Neat: .we introduce the matrix 
p(t) =exp (—7H) 
and write it in the form 
p(t) =exp (— 7TH) (7). 
The matrix (7) is thus defined analogously to the S-matrix in field 
theory. It satisfies the equation 
ae 
20) = Hinx(r) (7), &(0)=1 

which is obtained from the corresponding equation in field theory by 
replacing t by ir. As is known, the solution of this is 


S(0)=T,exp { oa | Alanto art, 


where 7’, is the chronologizing operator, which places the operators 
Hini(r) in chronological order for the ‘times’ r. 
For the statistical matrix p=(1/T') we get the obvious formula 


p=exp (H,|T)® 
S= 9(1/T)=T,exp { Be (in 


His(o) deh, A Saag oe) 


whence for the free energy we have 
F=F,—T \nSp {exp [(F) —y)/T 19}, wrist ene (2ie) 
where F, is the free energy of the non-interacting particles : 
F,=—T mSpexp (—4,/T). 
Equation (2.2) may be written as 
Hae ha PAI (9) inns chee Bie irs) 
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where the symbol <. . . .) denotes a Gibbs average over the states of the 
free particles : 
{...)=Spfexp [F)—A,)/T] . . .}. 

When we expand & in powers of Hint, average each term of the series, 
and finally take its logarithm, we get a thermodynamical perturbation 
theory series for the free energy. The above averaging is equivalent to 
a calculation of the average values of the ordered product of the various 
electromagnetic field and particle operators, for example 

(THA gH, TH)Ag(t a, Ta) ("35 Ta)b(ra, Tat). - = - (2-4) 
Expressions of this type are also met with in quantum field theory. 

The technique of Feynman diagrams is based on the following two 
properties of the equations of field theory: firstly on the possibility of 
representing all the quantities of the theory (the S-matrix, etc.) as 
averages of ordered products (7'-products) of a different number of field 
operators, and secondly on the so-called Wick’s theorem, according to 
which the average of the 7'-product of any number of free particle operators 
may be expressed in terms of the product of all possible averages of these 
operators taken in pairs. These pair averages are the free particle Green 
functions mentioned above. ~ Thus the average of any quantity may be 
expressed in terms of these Green functions. 

Equations (2.1), (2.2) and (2.4) show that the first property is also valid 
in the thermodynamical theory. In this case Wick’s theorem is still 
valid but becomes here, it is true, an assertion exact only as the total 
number of particles V tends to infinity (for a given particle density : more 
accurately it is valid only to terms of order 1/N.) Applying Wick’s theorem 
to an expression of the type of (2.4) we get, for example, 


(T {A(t 71) Ap(Fe, 72)4("3, T3)P(tra, Ta)}) 
= (T {A (ty 71) Ag(te, T2)} XT {h("'s, 73) (Ka, Ta)})> 
(TA{A,(M, 71) Ag (a, 72)4,(1'3, 73)A o(t'a, Ta)}) 
= (T,{A,( r},74)A B (rs, Ta)}>(T, {A,( rs, T3)A,(t4, 74)}) 
+(7T,{A, (1, 71)A,(5, 73)} (1 {Ag (2, T2)A o(Pa, T4)}) 
+ (T{A, (ty, 7) A g(a, Ta)}){T {Ag(ta, T2)A,(¥3, 73)})- 
It is clear that the technique thus developed is completely analogous to 
that of field theory, with the sole difference that the zero order Green 


functions for free particles and photons are replaced by the temperature 
Green functions 


6M — CT (ris 71) Vas Tat poet ee eat ee (2.44) 
for free particles, and 
Bg = — {7 {A, (ri, 71) Ag(t'g, T)}) . . . . . (2,40) 


for photons, while the time integration from — 0 to + 0 is replaced by 


an integration with respect to the ‘imaginary time’ + between the limits 


zero and 1/7’. 
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The Feynman diagrams which describe corrections to the free energy 
take the form of closed loops. For the interaction between particles and 
the electromagnetic field the second-order diagram is given by fig. 1 (a), 
while fig. 1 (6), (c), (d) represent the fourth-order interaction. (Here the 
full line represents the particle Green function and the dashed line that of 
the photon.) We note that for a correction of a certain order of perturbation 
theory we need take into account only the connected diagrams of that 
order (the order of a diagram is clearly the number of vertexes it contains), 
i.e. the diagrams which do not separate into components which are not 
connected at least by one line. For example, fig. 2 need not be taken 
into account in a sixth-order correction. This property is related to the fact 
that the expression for the free energy is of the form In¢....). It can 
be shown that when the logarithm is taken all the unconnected diagrams 
cancel out. 


© OQese 6) 6) 
a b Cie ae eee SO 
a) 
&) 


The perturbation theory series for the free energy has however one 
unfortunate shortcoming. It turns out that the diagrams contribute to 
it with coefficients which vary as 1/n, where n is the order of the diagram. 
(Clearly coefficients of the form a”, where a is a constant, could be dealt 
with, for example, by the formal inclusion of a in the charge.) This 
property of the series for the free energy makes it practically worthless 
for problems where the interaction constant is not small and it is necessary 
to sum an infinite series of diagrams. Fortunately this applies only to the 
diagrams which consist of closed loops, for diagrams containing external 
lines the coefficient does not depend much on the order of the perturbation 
theory. 

The most important of these are those diagrams with two free ends as, for 
example, the diagrams in fig. 3. The sum of all possible diagrams with two 
external photon lines is called the full temperature Green function of the 
photon. It clearly depends on eight variables : spatial coordinates and the 


P.M.S. 2 
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‘time’ 7 of the free ends. It is not difficult to write down an analytical 
expression for it in terms of the operators in the interaction representation. 
In particular (cf. (2.45): 

Bys("y, 713 Vo T2)= — (T{A,(r1, 7)A,(ro, T2)B} »/<®). 
Analogous formulae hold for the particle Green functions. We also give 
a very useful formula, which expresses the full temperature Green function 
of the photon in terms of the operators in the Schrédinger representation : 

Sp {exp [(F — H)/T Jexp [H(71—72)]A,("1) 
exp [—A(7;- T2)]A,(r2)}, sa >T2; 
DBag("i, 713 "a, T2) = — (2.5) 
Sp {exp [(/ — H)/T ]exp [— H(7 —72) Ag (re) 
ke | [H(7,—72)]A,(ry)},  11<Te2- 
The formula for the Green function of a free photon is obtained from this 
by replacing H by Hy and F by F. ; 

It is clear from (2.5) that B,, is a function of the difference 7,—7,=7 
(Bg=B,("1, "23 7)). Fora spatially homogeneous system it will similarly 
be a function of r,—ry. 

The full temperature Green function is simply and conveniently related 
to the free energy and a knowledge of it is sufficient to determine all the 
thermal properties of the system. However, its actual calculation by the ~ 
Matsubara technique is still quite difficult. This is because the success 
of the methods of field theory is largely due to the automation of the 
calculations which is possible because of the expansion of all the quantities 
in Fourier integrals over all coordinates and times. In the Matsubara 
method this is not possible because of the finite permissible range for 
7; @ and D® are discontinuous functions of the variable 7, and all the 
integrals with respect to 7 in fact reduce to integrals over a large number 
of regions, the number of which grows rapidly (~2”) with the order of 
approximation. 

The Matsubara technique can be considerably improved if we make 
use of certain general properties of temperature Green functions 
(Abrikosov et al. 1959, Fradkin 1959). 

As has already been pointed out, the Green function depends only on 
the difference 7,;—7, and as such is defined in the range —1/T to +1/7. 
It is therefore useful to expand it in a Fourier series in r=7,—7,1: 


(7) ae, TS. exp ( oe iE,T) O(E,), 


] —1/7 
G(é,,) = | exp (7€,7)@(r) dz, €,=anT, 


2) ur 
(and analogously for B(7)). 


} The Fourier components should be distinguished from the functions by 
some additional index; this has been omitted in order to simplify the notation. 


General Theory of Van der Waals Forces 173 


The following general property of @ is fundamental in the manipulation 
of the perturbation theory series. It follows from the expression (2.5) 
for B that the photon Green function for negative values of 7 is related to 
B for r>0 by 


B(7r)=B(7+1/T), 7<0. Bi earns aC 2ed) 


This also applies to any Bose particle Green function. For Fermi particles 
we have instead 


G(7)=—G(r+1/T), 7<0. oe OT) 


Equations (2.7 @) and (2.70) follow easily if we note that we may cyclically 
permute the orders of the operators under the trace sign in (2.5) and in the 
analogous formula for Fermi particles. The formulae (2.7a) and (2.7b) 
are of course also valid for free Green functions. 

If we further note than an even number of fermion lines meet at each 
vertex of the Feynman diagram, it is easy to see that all the integrals 


1 
of the form | ... dz in the perturbation theory series may be replaced 


0 
Lue : 
by = | ...dr; after this the transformations are easily made. The 


T 

relations (2.7a@) and (2.76) also mean that the Fourier expansion of boson 
(or photon) Green functions contains only those components with 
‘frequencies’ €,=2n7', while the expansion of fermions contains only 
components with €, =(2n+1)r7T. 

If we carry out the Fourier transforms with respect to the spatial co- 
ordinatest and the ‘time’ 7 in all the terms of the perturbation theory 
series for the Green function (or for the free energy), it is easy to see that 
the technique which arises thus is fully equivalent to the diagram technique 
of quantum field theory in the momentum representation. To each line 
of the diagram there is a corresponding free particle Greén function 
G(p, €,), and for each vertex there is a 6-function which expresses 
the conservation laws }p=0, Dé, =0. The integration and summation 
with respect to all momenta and ‘frequencies’ are carried out for each 
line. Formally the expression for the correction due to a certain diagram 
in the above theory may be obtained from the expression which would 
correspond to this diagram in field theory by replacing 


etd (re A on T Cre. 


The close connection between the above theory and the technique of 
quantum field theory allows us to apply many results of the latter to the 
present theory. As in field theory the temperature Green functions 
satisfy an integral equation of the type of the Dyson equation. 

Consider, for example, the diagrams of various orders for the photon 
Green function. Besides the diagrams in fig. 3, we include those of the 
type of fig. 4 and others which are more complex. 


COU So ee ee 
+ This is, of course, only possible for a body which is spatially homogeneous. 


” 


02 
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All possible diagrams may be represented as shown in fig. 5, where the ~ 
dashed loops denote the sum of all graphs which do not reduce to com- 
ponents joined by only one photon line. This summation of diagrams is, 
of course, possible only because the coefficient for a particular diagram 
does not depend essentially on its order (in the sense used above in the 


discussion of the series for the free energy). 
Thus in order to calculate the total photon Green function it is necessary 
to sum the series shown schematically in fig. 5. This is of the form (for a 


spatially inhomogeneous system) : 
Dyolts. tas &) oe rai Go)+ [Bare tas Ge 
alls, Fas En) (ha, Yo3 En) Urs dry 
fv (ry, 635 E,)U1, (3, a3 Eno O(ta ts 5 E;,) 


TL (t 5, 63 €n)B,e(s, Yo; &,)drsdrydrsdret... 
(2.8) 


Here II,,(r,, r.; €,) 18 the so-called polarization operator of the system, 
which is equal to the sum of the graphs shown in fig. 5 by dashed loops. 


Fig. 4 


V=--—-+ -Q>--— + ---->—-@--- + 
+ -----<QD---Q>---D--- + 


Fig. 6 
— ee eute ame ame + —---<LYD — om 


By re-writing (2.8) as follows 


Bag("1, Wo; €,) =B,(r, re} f4) + | dr, drBD (ry, P33 E,, I 3(ts, Cy Sa 
x {BD 5, (14, Po; ap, ti [ars dr. BD, (r,t 5 ; E,)IT, (5, re; cs) 


XB, (re, o3 En) + | dr, dr,dr, dry .B,,(r,, rs; ex 


% IT, (ts; re3 £7), (re, rs; €,)U1,,(t 7, rg; h) 
as BD 3 (rg, V9; es) als 7 


e 
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it can easily be seen that it is an integral equation for B of the form 
HE Ge Lp) 3 Cia) ne Ors, Ln) ; ca) 


oe [2.0 P35 Sa) lea hss Lar E,,)Bag("a, ro; €,)dr3 dry. 
(2.9) 


The summation process is shown graphically in fig. 6. 

In general it is not possible to write a closed equation for the polarization 
operator, nevertheless the Dyson equation is extremely useful in various 
practical problems, since it is often possible to find approximate equations 
for the polarization operator which enable one to go beyond the frame- 
work of perturbation theory. 

For the long wavelength photons which we shall consider later the 
polarization operator can be expressed in terms of the dielectric permea- 
bility of the body. 


§ 3. THE ENERGY OF a CONDENSED Bopy AssociATED witH Lone 
WAVELENGTH ELECTROMAGNETIC FLUCTUATIONS 


We now turn to the solution of our basic problem—the calculation of 
the increment to the energy of a condensed body due to long wavelength 
fluctuations of the electromagnetic field. To do this we take out of the 
Hamiltonian of the whole system that part which represents the inter- 
action energy of particles with the components of the electromagnetic 
field which have wavelengths much greater than interatomic distances 
(A>a), and we shall treat this a perturbation :+ 


H=Hy+ Hy =Hy~ | Alrite) de. 


The interaction between particles (electrons and nuclei) and the short 
wavelength field is placed in the unperturbed Hamiltonian. This also 
includes the short-range interatomic forces which hold the body in the 
condensed state. The vacuum energy of the long wavelength electro- 
magnetic field is also included in Hp. 

We now calculate the corresponding corrections to the free energy. 
However, as is easily seen, the results of the previous section are not fully 
applicable to the present case. The Matsubara technique depends on 
the above-mentioned Wick theorem, according to which the average 
value of the product of a large number of operators can be expressed as 
the product of various averages by pairs. However, Wick’s theorem is 
valid only if the Gibbs average is taken over states of non-interacting 
particles. Here this condition applies only to the operators of the long 
wavelength electromagnetic field, the averaging of the particle operators 
being over their states in the condensed body, and therefore the average 


+ Mathematically the separation of the long wavelength part means that the 
integral in this equation is in some way cut off for small r. We shall not 
introduce this cut-off explicitly since the answer does not depend on it. 
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values of the products of the operators will not reduce to averages by pairs. 

We now proceed as follows. In the perturbation series for the free 
energy (or for the Green function of the long wavelength photons) the 
particle operators appear only in combinations of the form 


(They, TP, 71)P(a, T2) (Ka, T2)} >; 
CT AB ey, Tr) (ty, 71)P (ra, Ta) (Pe, T2)P (Ks, T3)P(¥3, T3)(1rq, Ta) (Kg, Ta)} ), Cte. 
i.e. the number of operators under the averaging sign is always divisible 
by four, and the operators always appear in pairs of the type #(r, +)/(rr, 7). 
We subtract from the average value of the product of eight operators | 
the quantity 


Crt Bry, 71) (Fy, 7 )P (Pe, T2)ih(y, T2)} ) (TLf(ts, 73) (Ks, T3)h (4, Tq)#(4,7a)}) 
HT {b(ry, 74 )(, 71) P("s, T3)¥6(I'3, T3)} TA ("o, To) P(e, 72) (4, T4)%h(t4,74)} > 


+ (T Abr, Ty )h (ty, 7) (ta, Tr(a, Ta) KT (We, T2)H(Pe, To)f(¥5, T3)h(3, T3)}) 


(i.e. that value which we would have obtained if the averaging had reduced 
only to all possible averages by fours of the above type), and we call this 
difference the irreducible quadrilateral, denoting it by a dashed square. 


Fig. 7 


Next, from the average value of twelve operators we subtract the quantity 
which we would have obtained by reducing it to all possible combinations 
of four and eight operators. The remainder we call the irreducible hexagon 
(the dashed hexagons in figs. 7, 8), ete. 

It is now not difficult to see that the perturbation theory series will be 
described by diagrams of the type shown in fig. 7 (for the free energy) and 
fig. 8 (for the Green function of the long wavelength photons). The 
dashed loop denotes the quantity obtained from the average value of four 
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particle operators. The fact that for it we have used a notation which 
was applied in the previous section to the polarization operator will be 
justified by later results. | 

Physically it is immediately clear that the diagrams which contain 
irreducible quadrilaterals, hexagons, etc. give negligible contributions, 
since they take into account various non-linear processes such as the 
scattering of light by light. This assertion can also be proved in another 
way. Since we have included in AHjy; only the interactions with the long 
wavelength photons, it should follow that all the integrals over the 
momenta of the virtual photons are cut off at some ky which is considerably 
smaller than the reciprocal of the interatomic spacing 1/a. This is evident 
because each long wavelength photon line over which the integration is 
taken introduces a small quantity of the order of kya. The only diagrams 
which do not contain an integration over the photon momenta are figs. 7 (a) 
and. 8 (a) (it should be noted that in the zeroth approximation the Green 
function for the photon depends only on the difference of the coordinates). 


Fig. 8 
—--- —-->---  --p------ 
a 
YAEL ARS 
t \ 
fie (<Q. ‘s ane 
te | : oe 
\ ) i oh Se 
a Aes ere 7 Ee \ 


Thus for the approximation kya <1 only diagrams of the form 7 (a) give 
a correction to the free energy. The corresponding expression for the 
free energy is 


F=F,—3T S {oa("1, Po; E,)BBea(Fe: r,; €,)dr dr, 


i 5 | Mastr 25 En) Boy (Pas ts 5 En) (a> tas Soo (ra, hs En) 

x dr, dr, dr,dr, 

Bre noe = |Haglt ro 5 €,)3Bg,("e, Ws; é,). - TES (Fp oatonts er) 

DB, (tom "13 n) Ory. + Mant... - tie iemmettetaer (a0) 


where II,,(r,,r2; €,) is the quantity denoted in the diagram by a dashed 
loop. Attention is drawn to the coefficients preceding the integrals (1/m 
for the mth term) which give the contribution from various diagrams. 
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When, in the same approximation for the photon Green function, we 
take into account only the contribution from the diagrams of fig. 8 (a),we 
obtain, by the same method as was used in the last section in the deduction 
of the Dyson equation, an equation which is formally identical with (2.9) ; 
however, in our approximation the polarization operator II does not include 
a contribution from the virtual photon lines and is a given function which 
depends only on the properties of the body. 

The fact that we are dealing with long wavelength photons allows us to 
express the polarization operator (and also the photon Green function 
and the free energy of the system) in terms of only the macroscopic para- 
meters of the body. The only quantity which characterizes the inter- 
action between the condensed body and the long wavelength radiation is 
its dielectric permeabilityT. 

We consider an isotropic (but inhomogeneous) dielectric. The Heisen- 
berg operators for the field strengths satisfy the Maxwell equations (¢ is 
ordinary time here) 

rotH = 2 (cE), rob B= — Dy eS ee bt Meee 
In these equations the dielectric permeability is a linear integral operator 
which acts on functions of ¢ and has the form 


2E() SEQ f(t—t’')E(t’) dt’. yn ayeeem (aa) 


It is difficult to deduce directly from eqns. (3.2) the equations for the 
operators which depend on the Matsubara ‘imaginary time’ 7. We 
therefore use a relation between the temperature Green functions and 
those of field theory which was obtained by Abrikosov et al. (1959) (see 
also Landau (1958)). 

It turns out that the temperature Green function for the photon 
DB, (r),"s,€,) is simply related to the retarded Green function of the 
electromagnetic field D,,* which is defined as 


Dyg*(¥y, 25 ty — ty) = 
{ — Spjexp [(F —H)/TI[A (ry, ty) Ag(tr2, te) —Ag('g, ts) Ag(tas t1)]) st > te, 
0 hy <ty. 
(3.4) 
(Here the A,(r,t) are Heisenberg operators). Considerations analogous 
to those of Abrikosov et al. (1959) for a homogeneous body, lead to the 
conclusion that B,,(ry, ry; €,) may be expressed in terms of the Fourier 
component of the function D®, Explicitly, if we define 
Dg™(r1, 0g} 0) = | exp (twt)DR(r,, rg; t) dt 
then for €, > 0 the relation 
Ba("1, 0a; €,)=D*(r,, re; t£,). ths yeients. A aone 


Sn eas Ne a wee eK sd SS 
t+ Henceforth we shall neglect the magnetic properties of the substance as they 
are of no importance in the frequency range considered. | 


General Theory of Van der Waals Forces 179 


is valid. ‘The value of B,, for €, <0 can be obtained from the formula 
for the complex conjugate quantity B_,,*, which follows directly from the 
definition of the temperature Green function (2.5) and the hermiticity of 
the electromagnetic field operators: 


Boel, %o3 En) =Bea* (tet; —En)- fe eet oc) 


The equation for the retarded functiont is now written using the 
operator Maxwell equations. The question of the gauge of the vector 
potential now becomes important. The tensor D,," (or B,,) has ten 
independent components but we still have at our disposal several arbitary 
constants because of gauge invariance. Actually physical significance is 
attached not to the quantities D®,, which contain the vector potential 
components, but only to six corresponding quantities formed from the 
components of the electric field strength. Thus the ten quantities must 
satisfy only six physical conditions, i.e. we have four arbitrary functions 
at our disposal. We may use these to reduce the components Doo” and 
D,* to zero. Such a choice clearly corresponds to a gauge with a zero 
scalar potential. The Heisenberg operators E and H are now related to 
A by the equations 

oA 


E= a? H=rotA, 


while the operator A itself satisfies the condition 

0 

ot 

For the gauge chosen the commutation relations for the operator A are 
of the usual form : 


(<p) +robr0t A=0. i ae haa em Oa 


Ee us Aye’ | = —47id(r—r’)d;,. yt eke (308) 


Using eqn. (3.7), the commutation relations (3.8), and transforming 
to time Fourier components, we get an equation for D,,*: 


fe(r, w)wd, — rot jn LOt my} Dy (", r’ ; w) = 478(F — FY di, ae eh 89) 


wm 


Here the symbol rot, denotes the operator 
C74 0/02, 


where e,,, is the alternating tensor. Replacing w by %€,, in (3.9), we find 
that the function B,,(r,r’; €,) satisfies the equations 


{e(r, Ve nlon, On oF LOtinn LOtyDixll; r’ ; -)) Se. 473(r ‘a r')8i7, 2 (3. 10) 


forsee = 0: 
The dielectric permeability for an imaginary frequency which appears 
here is related by a simple expression to the imaginary part of the dielectric 


+ We emphasize that the choice of retarded functions is important, since the 
operator ¢ relates the value of the field strength at a given moment to its 
previous values. 
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permeability for real frequencies (see, for example, Landau and Lifshitz, 
1960, $58): 
2 ff” we'(w) 
(evel tes - dev. Pet oe 2X 8, 
e(igalt— | Spade 
Since «”>0 it follows that ¢(i&,) is a real, positive, and monotonically 
decreasing function of €,,. 
Because (ié,) is real so is the Green function B;, for €,>0. For €, <0 
its value is determined by the relation (see (3.6)) : 
Bji(t1, 02,5 En)=Byilta "13 —Fn) -« « | (3.12) 
By using the Dyson eqn. (2.9), it is not difficult to show that the polari- 
zation operator also satisfies the same relation : 
H(t, Vo; ca) a IT ,.;(t 2, ey ay! on) a > a ae (3.13) 
We can now express the polarization operator of the system in terms of 
e(ié,,). To do this we operate on eqn. (2.9) (for our choice of gauge the 
components B,, with «= 0 or 8 =0 are zero) from the left using the operator 
€ din + LOty POty,.. 
Since B satisfies (3.10) and B® satisfies the same equation with ¢(7é,,)=1, 
we get 


[ mic rot; €,)By(t', te; €,) dr’ = 


whence we see that for €, >0 


€(ry, 2) rh? 1 


4 £7 Bii(01, Po; fart £ 
TT 


? n =f 
Tix(ta 023 fn) = iene €,,78;,0("y — Fe). 


Defining from (3.13) the polarization operator for €,<0, we finally find 
that for all €, : 


 i{é,))—1 
TT .(ry, Po; ER) ee ae E78:.5(0y x r,). 7 (3.14) 


The fact that the polarization operator is proportional to 8(r,—r,) is related 
to the neglect of the effects of spatial correlation in the macroscopic theory. 
These effects are important in metals (particularly in superconductors) 
at the frequencies of the anomalous skin effect. However we shall be 
interested in higher frequencies (infra-red and above) in which range 
there is no spatial dispersion. 

Having expressed the polarization operator in terms of the dielectric 
permeability of the body, we could, in principle, calculate the correspon- 
ding correction to the free energy from eqn. (3.1). (The Green function for 
the free photon can be found directly from the definition (2.46) or by solving 
eqn. (3.9) with e=1.) However, as we have already pointed out, the series 
(3.1) cannot be summed directly. Instead we determine the additional 
pressure (or more accurately an additional stress tensor) which arises as a 
result of the interaction with the long wavelength fluctuation field. 

To do this we imagine that the body is subjected to a certain small dis- 
placement described by a vector u(r). The resultant change in free 
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energy of is —jfudV, where f is the force on unit volume of the body due 


to the deformation. The corresponding change in the unperturbed energy 
bF'y is 

| ugrad p,dV 
where ,(p, 7) is the uncorrected pressure for a given density p and 
temperature 7’. For a given displacement the only change in the series 


for the correction is in the polarization operator because only this depends 
on the properties of the medium. We have 


811 ,,(t, Po ep) ra (1/4ar)E,,76;,8(ry i ry)de(ry, ilé,|)- 
Variation of the series (3.1) gives 


SP=BFy— Seg? { dr. Belts iféu))OBiOC sf) 


TT n= — co 


[Bw V1; EU (4, Po; E,) BO", r; G.) dry dr, 


Ll | B;,O(r, ry; EU ja(t1, Po; En) Bim (¥e, V3; 7) lige (has V4; con) 
XBpO(r,r; €,) dr dr.drzdr,+.. .}. 


The series in curly brackets is that for the photon Green function which 
corresponds to the diagrams of fig. 8(a). Therefore we have 


8F = 5F)— — s Fell B,(r,r; €,)8e(r, i|é,[ar. 


TT y= — 0 


By using (3.12) we finally obtain 


6F =6F,— = s” C77) | Dr, rs 6, oh, tl,)a0. f°. (3.15) 


n=0 


The dash on the summation sign denotes that the term with n=0 is given 
half weight., We note that €, =2znT’. 
The variation of « is related to the displacement u by the equation 


Se= —ugrad e~pSdivu. ee eta ie al) 
p 


Substituting this in (3.15) and integrating by parts we obtain an expression 
for the force acting on unit volume of the body 


f= —gradpy— 7 3’ f,°Mi(rsrs &,) grad e 


ue = >) f° grad {Bir F; Sn)P st. sates is WG) 
4m n=0 Op 
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From this formula it is easy to calculate the correction to the chemical 
potential of the body. First we note that f=0 in mechanical equilibrium 
and that for a given temperature the relations 


grad ep, 1)=Sgradp, pulp, 2)=plie, 7). (3:18) 


are valid (where (,(p, 7’) is the unperturbed chemical natedtial per unit 
mass of the body), then we equate (3.17) to zero and a simple transformation 
gives 


peradl=0, 
where 
t: 
C(p, T)= Lo(p; fy neg vi -> E, Bi( rir; Ese z - (3.19) 
T n=0 


As is known, the equilibrium condition for any LA att body is 
that the chemical potential should be constant over it; it is therefore 
clear that (3.19) defines this potential (per unit mass). 

We now turn to the calculation of the pressure. We need to reduce the 
expression (3.17) for the force per unit volume of the body to the form 


fm 
Lhe 
where o;, is the strain tensor. The resultant calculations are almost 
identical with those carried out in electrodynamics to find the Maxwell 
stress tensor (see, for example, Landau and Lifshitz 1960, §15), but we 
shall discuss them briefly here. 
First we introduce, besides the photon Green function B;,, two related 
functions 


(3.20) 


B;,"(r, r’ ; aay os mit AU r ; ae 
diet (3.21) 


B,"(r, ire ; ea) ae roti Lot pm B,,,(r, 


which are made up from the electric and magnetic field operators in the 
same way that B;, is made up from the operators of the vector potential. 
We now rewrite (3.17) in the new notationt 


Op f LE: 0 
fim — Fe + FS ae [OBC pele, Oh 
ail : Fi) 
— 7m (r) 5 Ban ‘(r, Fl eV CR) Ca ce heen Ps (3.22) 


We now transform just the last term in (3.22); taking out the summation 
and the factor 7'/47, we rewrite it in the form 


i : ; 
e(r ) > Dicks (Vr, r)+e(r 5 ; 


with the intention of putting r=r’ at the end of the calculations. 


B;,," We ra 


$m SSS 
| To be concise, we shall omit the arguments é,, and 7€,, in the intermediate 
formulae. 
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Carrying out further obvious transformations we get 


» 2 te ee 
2 eB Ee) (BP 0) — = ele M0, F) 


Swine ae 
Oxy, 


) ; 0 
Fete] sD Bw] gay 


1 0 a ra) . 
a en A ser jn Baw (e r ea. ies ), 


0 yes 0 j 
<(r) E Bj" (r, a Wee ie r | = 
o 


ane one | 
ened Lf )+ aaeee (Mo r GET ao ea ). 


He 
Substituting these in (3.21) and putting r=r’ gives 
g E 
stat == 9 
<(r) One (r,r) Ox, 
Finally substitution in (3.22) shows that the force can be expressed in 
the form (3.20) with a stress tensor 


e(r)B;,,"(r,r) +2 ow, Hr, r= 2 BM, r). 
a 


eed { ~ 48, Ee ié,) —p BP (rr; &,) 
ATT n=0 dp é 

ai e(r, 1E,) Bin” (r, r; Sr) e 30; (¥, r; eal +B (r, Yr; En) (3.24) 
However, this formula still does not have a direct physical meaning, since 
the quantities B,,“(r,r’) and B,,“(r,r’) both tend to infinity for r=r’. 
This is related to the fact that, unless a corresponding cut-off is introduced, 
fluctuations with small wavelengths make an infinitely large contribution 
to o;,. These have no relation to the inhomogeneity of the body in the 
sense that their contribution is the same in both homogeneous and in- 
homogeneous bodies which have the same value of ¢ at the point considered. 
The contribution of the long wavelength fluctuations to the stress tensor 
in an inhomogeneous medium, which is actually independent of the nature 
of the cut-off, is obtained by the corresponding subtraction in (3.24). 
The Green function B,,¥(r,r’) (and analogously for B,)”) in this formula 
must be interpreted as the limit of the difference 

lim [B,,2(r, r’) -—B,.7(r, r’)] 


ror 
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where B,,” is the Green function of an homogeneous infinite medium 
whose dielectric permeability coincides with that for the inhomogeneous 
medium at the point where the stress tensor is being calculated. To 
avoid excessively unwieldy formulae we shall in future write (3.24) in its 
previous form with the assumption that the above subtraction has 
already been carried out. Here po(p, 7’) is the pressure in an infinite homo- 
geneous medium for given values of p and 7’. 
The same remarks also apply to formula (3.19) for the chemical potential, 
which by using (3.21) can be written in the form - 
ml © 5 
Up, T)= Llp, T) + — >! A ely aris 2). +) (828) 
TT n=Q P = 

We note that the category of inhomogeneous media includes systems 
which consist of several bodies, each of which is homogeneous. In this 
case the components B,;, in the solution of the eqns. (3.10) must satisfy 
certain conditions at the boundaries between the bodies. We recall that 
in (3.10) the coordinates r are independent variables, while the co- 

-ordinates r’ are merely parameters. Hence we discuss the boundary 
conditions for the variables r. These conditions correspond to the 
continuity of the tangential components of the electric and magnetic 
fields. Since one of the indices (7) of the tensor B;, refers to the point r 
(in the sense of the definitions (3.4)), the tangential components of the 
tensors B,,,” and B,,,“ must be continuous with respect to this index. 

In form eqn. (3.24) is similar to the usual formula for the Maxwell 
stresses in an electromagnetic field, but the quadratic combinations of 
components of the electric and magnetic fields are replaced by the corre- 
sponding functions DB,” and B,,“. Too deep a significance should not, 
however, be attached to this analogy. The fact is that there are good 
grounds for the supposition that the concept of the stress tensor for an 
alternating electromagnetic field in a conducting medium is not meaning- 
ful. But in (3.24) we are dealing not with an arbitrary electromagnetic field 
but with the proper fluctuation field which is in thermodynamical equili- 
brium with the medium. 

Equations (3.24) and (3.25), obtained by Dzyaloshinskii and Pitaevskii 
(1959), solve in principle the problem of the calculation of the van der 
Waals part of the thermodynamic parameters of a body, by reducing it 
in each real case to the solution of the eqns. (3.10) for the Green function 
D;;.. 


§ 4. MotecuLaR INTERACTION FORCES BETWEEN SOLID BoprEs 
4.1. Derivation of General Formulae 


We use the general theory developed above to calculate the van der 
Waals forces acting between solid bodies whose surfaces are separated by 
very small distances. The gap separating the bodies may be considered 
to be filled with any liquid. In what follows we shall denote the two 
solid bodies by the indices 1 and 2, while the medium which fills the gap 
will be denoted by the index 3. 
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Although we shall represent the gap by two parallel planes, it should 
be noted that for a rigorous statement of the problem it would be neces- 
sary to consider at least one of the bodies as being of finite size and as 
surrounded by the medium 3 on all sides, the total forces on the body then 
being calculated. However, since molecular forces die away very quickly 
with distance this resultant force is in fact completely equivalent to the 
forces acting across the narrow gap which separates the two bodies. 

The total force acting on the body 2 can be calculated as the total 
momentum flux which enters the body from the surrounding medium 3, 
1.e. as an integral fo,,df, over the surface surrounding it. For this it 
should be noted that the medium 3 is in thermodynamical equilibrium, 
one of the conditions of which is that its chemical potential should be 
constant: €=const, where ¢ is given by eqn. (3.25). Since the long 
wavelength fluctuations give only small corrections to the density, we can 
‘take the density p to be constant throughout medium 8, and the change in 
the chemical potential ¢(p,7') equals the change in the quantity 
po(p; Z')/p (because (3.18) applies). Therefore the condition ¢=const. 
can be written as: 


SPS To) ; 
pop, L') + Fy PG Ba (rs #) = const. ee seat (4) 


As aconsequence of this condition part of the total stress tensor is a constant 
equilibrium pressure throughout the liquid, and this does not contribute to 
the total force acting on the body. Neglecting this constant part, i.e. 
subtracting from o;, the left-hand side of (4.1) multiplied by $,,, we find 
that to determine the force it is enough to write the stress tensor in medium 
3 in the form 


O; 


Five ota = > fes[Ba"(r, vr) — $5;,Bu8 (nr, 0)] + By" (0, ©) — 28;,38 74 (r, v)}- 
(4.2) 


We take the x-axis perpendicular to the plane of the gap, whose breadth 
is denoted by I (so the surfaces of the bodies are given by the planes x=0 
and «=1). We can now write the force per unit area of body 2 as 


Pl) = oz (1) = oe 5 {eg Byy2(l05 En) + Beclls 1; En) — Bae (bts En)] 


Ait n=0 
+B, 7 (1, l > En) +B," (1, l ; es) —B,," (1, l > E,)} 3 . . (4.3) 
where a positive force corresponds to an attraction and a negative to 


repulsion. 
Because the problem is homogeneous in y and z the Green function 
B,,(r, r’) is a function only of y—y’ and z—z'. We make a Fourier trans- 


form with respect to these variables 
Byle,2's a3 Ex)= | { exp [—iayly-y')—iaee—2) 
x Biylr, 05 En) dy—y') dz—z’) 
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and take the y-axis along the vector q. The eqns. (3.10) for the Green 
functions now become 


da? 


2 S, l / , 
(weg 5) Bul gw) +i — Boyle, x )= —4rd(x—2’), 
2 ue we stadt 
wD (Xe ) + 19 By, x )=0, 


WD (2, oe) = ig Ble 2’) Wie 4n3(x - x), 


: d? ro ae * 
(wa <) B p(x, x )+1q  Bralt,® )=0, 
where w=(cé,?+q?)"?, and a’ is a parameter (the components B,, and 
B,, of the Green function are zero because the equations for them are 
homogeneous). 


The solution of this system reduces to the solution of two equations 


2 
(w* _ =) BD, (x, xv’) = —4rd(x4—2’), 


dx 
d? 4rw am, 
(w* bs 5) Brule a’) = — e2 d(x —2’), 

after which B,,, and B,,, are calculated as 

D,,(0,0')=- 42 B,,(v,2') 
: 4.5 
By .(%, x) = — ad z —B,,(x, 2) = #T 5(¢— 2"), | ve 

w dx w 


The boundary conditions which correspond to the continuity of the 
_ tangential components of the electric and magnetic fields reduce to the 


requirement that the quantities B,,.”, B,,", B,,”, B., should be con- 
tinuous, or equivalently that 


; Bi, Bex; rot,B);,, Tot, By, 
should be continuous. 
Using (4.5), we find that 


D,, -B.,,B eae 


2) 7 juve rt ake es eee 


must be continuous at the interface. 

Since we are interested in the Green function only in the gap we can 
straightaway limit discussion to 0<w'</. In the range 0<a<I the 
functions @D,, and @D,, are given by eqns. (4.4) with e=6,, 
W= Ws =(ef,"+¢9")'". In the regions 1 (w<0) and 2 («>1) they satisfy 
the same equations without the right-hand sides’ (because here 242°’ 
always) with «, w replaced by €,, w, and ey, wy respectively. 
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The subtraction mentioned at the end of § 3 means here that from all the 
functions B,;, in the region of the gap we must subtract their values for 
€y=€2=€3, W,=W2z=W 3. Hence, in particular, we can drop the term 
containing the 6-functions in the second of the equations (4.5), so that in 
the gap the functions B,,, B,, are given by the equations 

Dy = — —— "B= — 5 (4.7) 

Before we solve these equations we make one further remark. The 
general solution of the eqn. (4.4) is of the form f+(~—2a')+f-(+a’). By 
using (4.4), (4.7) and the definition of the functions B,” and B,,” it can , 
be shown that the parts of the Green functions which depend on the sum 
-2+a’ make no contribution to the expression (4.3) for the force 7. We 
shall not discuss this further here since it is already obvious from physical 
considerations : if we put x=w’ in a solution of the form f-(x + x’) we would 
obtain a momentum flux in the gap which varied with the coordinates, 
and this would contradict its conservation. Therefore, below we shall 
give only the expression for the part of the Green function which depends 
on 2-2’. 

We now turn to the determination of the function B,,. This satisfies 
the equations 


2 
(vs? =) B,,(x, oe) = — 4718(% — 22’), 0<a<l, 
v 
d? ; 
(2 <) I) (aia) = 0, | x<O0, 
2 d? , l 
(. ~ 5) Belt 2") =0, Us 
Hence we find 


B,,—Aexp (w,2), as 
B,,= Bexp(—w,2), x >l, 
2 / 
D,,.=C exp (w3r) + C, exp (— wgr) — — exp (—w,|x—2'|), O<a<l, 
3 
Determining the constants A, B, C,, C, from the boundary conditions 
that B,, and B,,/dx are continuous, we find that B,,* 
4a 


2ar z 
ice ch w,(% — 2’) — —exp (—w|%—2x'|), O<7 <7 
D., Wad 3( ) We p ( al | 


(wy + Ws)(We + Ws) . 
eae renp (ily ene nae) 
ae (wy — Ws)(W2 — Ws) 
Subtracting the value of B,,+ for w,=w,=w; (when 1/A becomes zero), we 
finally obtain 


B,.t= <7 ch w(u—2'). > eee Steet aed) 
3 j 


P.M.S. ig 
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Analogously, solution for B,,, gives (after subtraction) 


47wWs 


Bogs Fe Rae ae) Ope ing SY 
n “3 
A= 1—exp (21ng)) 178 t Saas Se aa 


(€1W3 — €gty)(€gW3 — €gtg) 


and, by using (4.7) 


Dey = > an = sh w(x —2’), 
n €3 
Be! =— sil ch w,(% — 2’) ee 
ae E,7€gW3 A ‘ ea 


If we now calculate the quantities B;,(v, x’; q; €,) and B,,"(x,x; q; €,) . 
and substitute them in ee (4.3), we get: 


F)=- + dq.w 
mmf, 24 a+ 3): 


Transforming to a new variable of integration p, where q = €,,[(€,(p?— 1) }!” 
and returning to a normal system of units, we obtain the final expression 
for the force F acting on unit area of each of the two bodies (media 1 and 
2) separated by a gap of width / which is occupied by medium 3 (fig. 9) 


> tf (it Ps%tp) (En “1 
ant et | a alkaean 33 1— P)(S2—P) exp (“E8ive) -1] 


RENE ic, 2PEn 
ire — pey/€s)(s. 9 — Péq/€x) xp (=H ly ‘) “2 hap (4.13) 


where 8) = 1/(«/€3—1+p*), 8.=4/(€o/eg—1+p?), &,=20nkT/h; €, €, € 
are a ea of the imaginary frequeer w=tf, («=€(vé,)); k is Boltz- 
mann’s constant. The summation is taken over integral values of n, 
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and the dash attached to the summation sign denotes that the term with 
n=0 is given half weight. Positive values of F correspond to attraction 
between the bodies and negative values to repulsion. 

This formula (for «= 1, ie. for bodies separated by an empty gap) was 
first derived by Lifshitz (1954) by another method which did not use the 
techniques of quantum field theory. These techniques are however 
necessary for the generalization of the result to a gap filled by an arbitrary 
medium (Dzyaloshinskii e¢ al. 1959). 


4.2. Discussion of General Formulae and Limiting Casest 


The general formula (4.13) contains the functions «(w)—the dielectric 
permeabilities as a function of field frequency—for both solid bodies (e€, and 
e,) and for the fluid medium which fills the space between them (e,). We 
note that «(w) is a complex quantity [e=«’(w) +7e"(w) |, and that its imagin- 
ary part is always positive and determines the dissipation of energy in an 
electromagnetic wave propagated in the medium. The function ¢(w) is 
related to the refractive index n and the absorption coefficient « by the well- 
known expression 1/e=n+«. It is known that a formal consideration 
of e(w) as a function of the complex variable w leads to certain integral 
relations between ¢’(w) and e«”(w)—the Kramers—Kronig relations. The 
expression (3.11), which determines the values of the function « for a 
purely imaginary argument w=7é from the values of the function ¢"(w) 
for real w is a particular case of these relations; «(i€) is a real quantity - 
which decreases monotonically from e€, (the electrostatic dielectric constant) 
for €=0 to 1 for €> «. It is these functions ¢«(ié) which appear in the 
expression (4.13). We can therefore state that the only macroscopic 
properties of bodies that determine the strength of the molecular inter- 
action between them are the imaginary parts of their dielectric permea- 
bilitiest. 

Before we proceed with the discussion of the formula derived it is 
necessary to make the following general remark. If two bodies are 
separated by an empty gap, the electromagnetic forces which we have 
calculated are the only interaction forces between the bodies, but if the 
gap is occupied by some medium then we have the possibility of fluctu- 
ations in the medium which are. due to non-electromagnetic oscillations 
(sound, for example) and these can contribute to the interaction. How- 
ever, it will be shown in §5.2 that the contribution of these nonelectro- 
magnetic forces is usually small. 


+ Most of the results given in §§ 4.2, 4.3 and 4.4 are taken from Lifshitz 
1955). 
iedeues (4.13) was derived on the assumption that all the media were 
isotropic and therefore its applicability to crystals depends on the possibility 
of neglecting the anisotropy of the dielectric permeability. Although this is 
admissible in the majority of cases, it should be remembered that in general the 
anisotropy of the bodies also leads to a specific effect—the appearance of a 
couple which tends to rotate the bodies with respect to one another. 


P2 
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If the bodies are identical (€, = €,) then the expression under the integral 
is always positivet for all the terms of the sum in (4.13), and for given 
values of p and &, it decreases monotonically as / increases. Hence it 
follows that F >0 and dF/dl <0, i.e. identical bodies attract one another 
irrespective of the size of the gap between them, while the attractive force 
decreases monotonically as the gap:increases]. 

This statement is also valid for two different media which are separated 
by an empty gap (e,=1). If the bodies differ and the space between them 
is filled by some fluid, then the force between them can be either attractive 
or repulsive (see below). 

The general formula (4.13) is very complicated but it can be considerably 
simplified if we use the fact that the influence of temperature on the 
interaction force usually turns out to be negligible. 

Because of the presence of an exponential under the integral in (4.13) 
the important terms in the sum will be those for which €, ~¢// or n~ ch/lkT. 
For [kT/ch>1 large values of n will thus be important and we can 
replace the sum by an integration over dn=(h/27kT)dé. The tem- 
perature now drops out of the formula and we are left with the following 
result : 


F= sas | z | ” préte,a | eee exp (7H tv) = 1] ry 
0 1 


Ines (5: — p)(82—P) 
+ [re eae £1/¢s)(62 + Peels) oxy (-E lve) 2 | = dpdé. (4.14) 
(81 — Peq/€3)(S2 — Pea/ €3) c 
This formula is valid for separations |<ch/kT ; at room temperature this 
gives distances of up to ~10-4cem. 

Equation (4.14) is still complex but it can be simplified further in two 
important limiting cases. : 

First we consider the limiting case of ‘small’ distances, by which we 
mean distances which are small compared to the wavelengths A, which 
characterise the absorption spectra of the given bodies. The temper- 
atures which are applicable for condensed bodies are certainly small 
compared to the iw which are important here (for example, in the 
visible region), and therefore the inequality k7l/hc<1 is known to. be 
satisfied. 

Because of the exponential factor exp (2pél/ €,/c) in the denominators of 
the expression under the integral, those values of p for which pél/c ~ 1 are 
dominant in the integration with respect to dp. In this case p>1 and 
therefore we can put 8,28, p in the main terms. In this approximation 
the first term in curly brackets in (4.14) is zero, while the second term 


} This is easily seen if we note that for s=/(e—1+p2) (where p>1) the 
inequality ep>s>p holds for «>1, while for e<1 ep<s<p. 

{ This assertion was made earlier by Hamaker (1937) on the basis of the 
assumption (which is in fact not valid) that the molecular forces should be 
additive. 

§ When we talk of the influence of temperature, we exclude the temperature 
dependence related to the dependence of the dielectric permeability itself on 
temperature. 
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gives on the introduction of the variable of integration x = 2lp£e,1/2/c 


€, + €3)(€9 + Es) e - oe 
ua . 
Page [oe o eespr ees (€—- €3)(€9 — €3) % | Be iS Es a 


(in this approximation the lower limit of integration with respect to dx 
is replaced by zero). 

This force is proportional to the inverse cube of the distance, as would 
be predicted by the usual theory of van der Waals forces between two 
atoms. The functions «(i€)—1 decrease monotonically as € increases 
and tend to zero. Therefore values of € which lie beyond a certain €, make 
a negligible contribution to the integral; the condition that 1 should be 
small is that 1<c/€. 

To estimate the accuracy of the limiting behaviour derived above it is 
useful to have the next term of the expansion of the function F(J). 
Calculation from the general formula (4.14) gives (for similar bodies 
separated by a vacuum, 2 €)=€, €;=1) the expression 


E IP pLeeatlege © se see’ ee (4g) 


= Ra ey 
which must be added to (4.15). However, it is not possible to make a 
realistic estimate of the region of validity of the limiting law without 
knowing the form of the function «(#). 

To an accuracy which is quite sufficient in practice eqn. (4.15) can be 
written in an even simpler form by neglecting unity compared to the 
term with e” in the square brackets. (['The accuracy of this reduction is 
connected to the fact that an integral of the form 


tae glee er (4.17) 
nm! J 9 aew—1 
varies very little as a varies from oo to 1: from | to 1-2 for n=2, to 1-08 


for n=3, to 1-04 for n=4, etc.] Then the integration with respect to da 
can be earried out in an elementary way, and instead of (4.15) we get 
Tes ae calif) ge 4.18) 
| 873” o (€x(2E) + €3(r€))(€n(E) + €3(7€)) 

The quantity |@| is some characteristic frequency for the absorption 
spectra of all three media. 

We now turn to the opposite limiting case, that of ‘large’ distances ; 
1>A,. We shall however suppose that the distances are not so large as 
to invalidate the inequality [kT /hce <1. 

We introduce into the general formula (4.14) a new variable of inte- 
gration «=2plg€/c, but as the second variable we take not € (as above) 


but p: 
ee 8 off (81 + P)(S2 +P) exp (eVe)—1| =. 
ae —p)(s.—p) 4 
2 a cee Bey y Sex te pe todk 
aot A exp (Xe) — 1 dp da 
f (81 — Pey/€3)(S2— Peo/€3) : } . 
e=e(ixc/2pl),. s=/(e—14+ p*). 
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Because of the presence of the factor exp (x/¢3) in the denominators, 

values of «~1/,/e,<1 are the only ones important in the integration 
over dx, and since p> 1 the argument of the function ¢ for large / is nearly. 
zero over the whole of the important range of the variables. Because of 
this we can replace €,, €9, €; by their values for §=0, i.e. the electrostatic 
dielectric constants. If we then replace 2/€3) by x, we finally obtain the 
following result 


3he mee]. | seb 4 [eee So + P)(S20+P) ox =a oF 
~ 327 *l4/ €59 1 P? (L(S10 — P) (S20 — Dp) 
4 (S10 + Perl €s0) 829 + P€20/ €30) meee | | pax _ (4.19) 
(S19 — P€10/ €30)(820 — P €20/ €a0) 
S19 = V(€r0/es0— 1+ Pp), 820 = (€20/€s0 — 1 +p") 
where €19, €29, €39 are the electrostatic dielectric constant. 
In accordance with the above-mentioned property of integrals of the 


type of (4.17) eqn. (4.19) can be written with considerable accuracy in 
the form: 


3he (es Se 4 (810 — P€10/ €30) (820 — roma ee 
1677l4V/€39 J 1 S10 + P)(S20 + P) oF + P€19/€30)(820 + P€20/ €30) 

(4.20) 
Here only one integration remains and in principle this can be reduced to 
elementary functions; the result is however so unwieldy that in real 
calculations it is better to use numerical integration. 

It was pointed out above that if the two bodies differ and the medium 
between them is filled with fluid the interaction can be either an attraction 
or arepulsion. Thus, from (4.18) it is clear that if ¢,—«, and e,—e, have 
opposite signs in the important frequency range, then F <0, i.e. the bodies 
will repel each other for ‘small’ separations. At ‘large’ separations the 
forces are determined by the electrostatic values of the dielectric permea- 
bility ; if €y9 — €g9 and €y9— €g) have the same sign F’ > 0, while if their signs 
differ F <0. Furthermore, since the relative magnitudes of €19, €a9, €g9 are 
not in general related to the behaviour of the functions €,(7&), €,(7&), €3(¢&) 
in the frequency range which is important for these bodies, it is posssible 
in principle to have cases in which F' changes sign at some value of J. 

We now return to eqn. (4.19) and consider some of its special cases. In 
particular a simple result is obtained when both bodies are metals. For 
metals the function ¢(i€) tends to infinity as €>0; therefore we must put 
€9= 0. If we put ve E99 = 00, we get 


1 2 270 ar 2 
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This force is independent of the nature of Re metal (a property that does 
not hold for small distances, where the magnitude of the interaction depends 
on the behaviour of the functions ¢(ig) for all values of € and not just for 
€=0). For €3)=1 formula (4.21) coincides with that obtained by Casimir 
(1948) for this special case by considering the characteristic vibrations of 
the field in a gap between two walls which reflect ideally at all frequencies. 
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If both bodies are the same (€4) = €g9), (4.19) can be written in the form 


Deo Rewer e156 — ean \ 7 € 

SHOE Feaeatn) (ee) 7 OP 
where ¢,,(x) is a function whose numerical values are given in fig. 10 
(curve dd) for an argument ranging from 1 to 00; in addition we note that 
Paa(9)=0-52.. For > « ¢,, tends to 1 as 

paa(t) = 1— eas ; 

for x— 1 it tends]to a finite limit 0-35 (according to the limiting law (4.35) ; 
see below). 


Fig. 10 
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The same diagram shows the curve (dm) for the analogous function 
describing the attraction between the dielectric and a metal (€g)= 00) by 
the following formula: 


gw hE LD <€49=€59 E16 
iA 240 UF ¥/€39 €10 + €0 Pam (e) , RE Cae 
For «,— © the expression (4.19) tends to zero. This means that when 
the gap is filled with liquid metal the interaction force at ‘large’ distances 
drops off as a higher power of 1//. This peculiar case is in principle of 
some interest although it is hardly of practical significance. To study 
it we must return to the initial expression (4.14) and in it take into account 
the real law according to which the dielectric permeability of the metal 
increases with a reduction in frequency. 
For in a metal in the infra-red the function «(w) is given with sufficient 
accuracy by the formula 
el) = — 4rre2@N im? 2 ce 8. a ee (4.24) 
where WV is the density of the number of free electrons. When 
e,(i¢) =47e? N/m is substituted in (4.14) the exponential factors in the 
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denominators of the expression under the integral become 


2pl Are? 
eR (2 J ( mN )} ad 


ie. € drops out of the expression, while since p covers a range where p> 1 
we conclude that the region of frequencies considered gives a contribution 
to the force F which decreases exponentially as the separation / increases. 

In this case the main contribution to the interaction force is given by 
still smaller frequencies, where ¢«(w) is related to the usual electrical 
conductivity of the metal o by the expression 


e(w) = 4710 /w so eee 


We substitute ¢,(i€) = 470,/€ in the expression (4.14) (in the exponent and 
in the factor ¢,%/2; otherwise it is sufficient to put «,= 00) and change the 
variable of integration from € to 2=4pl/(703€)/c, and we finally obtain 


Sega he (te dl rc l 
P= Ed) Ee ieee pase 
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The calculations of the double integral (using the properties noted above 
of integrals of the type of (4.17)) gives the value 13-5, and as a result we 
have the following formula : 

F=0-0034hc2/o®. . . 2 2 2 2. (4.26) 
Thus, when two bodies are separated by a layer of liquid metal, the decrease 
in the molecular interaction as /-* for ‘small’ distances becomes a decrease 
as 1-5 for ‘large’ distances; though the onset of the latter is delayed by 
the presence of the small numerical coefficient. 

It is natural to ask the real value of the wavelength A, with which the 
distance / must be compared. However, the answer cannot be given in 
general as it depends on the real form of the spectral distribution of 
absorption by the given bodies (i.e. on the particular properties of the 
functions e”(w)). As an example we shall calculate the range of applic- 
ability of equation (4.21) for the interaction of two metals (which we shall 
take to be identical) separated by a vacuum. 

Equation (4.21) is obtained from (4.14) by putting «,=«,= 0 (€,=1). 
If we wish to obtain the next term of the expansion as well, we must use 
the form (4.24) of the function «(w), as this is valid over the frequency 
range which is important in the integration. (The region of still smaller 
frequencies, where e(w) is given by (4.25), gives a very small contribution 
to the integral considered). After substituting «(ié) in (4.14) we replace 
€ by xe/2pl; an expansion of the expression under the integral in powers 
of 1/l gives 


7 he Qt “oe m pw wtetde [© 24+] 
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As a concrete example N = 5-9 x 102 for silver, and we find that the second 
term is small compared to the first provided 10-6 10-4cm. We note 
that the next term of the expansion derived here could not have been 
obtained by the method which was used by Casimir to derive the first 
term. 

The scope of this article does not include a review of the experimental 
data on van der Waals forces. We merely note that the first reliable 
measurements of the molecular attraction between solid bodies (on quartz)+ 
were made by Deryagin and Abrikosova (1956) and Abrikosova (1957) and 
were in good agreement with theory. A detailed exposition and discussion 
of these data is given in the review articles by Deryagin et al. (1956, 1958). 
Similar measurements were also made by Kitchener and Prosser (1957) 
and de Jongh (1958). 


4.3. The Influence of Temperature 


All the formulae quoted in §4.2 are obtained on the assumption that 
kTl/he <1, in accordance with which we retained only the first (zeroth) 
term of the expansion in powers of the temperature when we transformed 
from (4.13) to (4.14). To estimate the error which this introduced we 
need to find the next term of the expansion. We now do this for two 
identical metals separated by a vacuum. 

The replacement of a sum by an integral in the derivation of (4.14) 
corresponds to the use of the first term of Euler’s well known summation 
formula: 

—y se I , 1 Ml 
Y fin= | Ferdnt SF 0)~ apg hk O)+ 

Here the role of the function f(n) is played by the integral under the 
summation sign in (4.13). In the calculation we assume that / is small 
compared to fic/kT’, but still large compared to (c/e)\/(m/N), the charac- 
teristic quantity for the metal (see (4.27)). Then f’(0)=0, f”"(0)=2 and 


thus 
7 he 48 /IkT\ 4 
Ss ee 2 ALY NS eet: 4.28 
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+ The case of quartz has certain peculiarities because of the specific properties 
of its absorption spectrum. Quartz absorbs strongly in the ultra-violet (from 
approximately 0-15 1) and in the infra-red (from several 1) and between these 
wavelengths it is transparent. For separations / which lie in the transparent 
region a reasonable estimate of the force # may be obtained by taking / to be 
small compared to A at the right-hand boundary and large compared to A at the 
left-hand boundary of the region. The contribution of the ultra-violet 
absorption band to F can be estimated from (4.22) by putting <y9= €y9= 
(€3)= 1) equal to the square of the refractive index in the transparent optical 
region. The contribution of the infra-red region is given by (4.18); in order of 
magnitude it is less by a factor lw/c (wo is the infra-red absorption frequency). 
Thus to estimate the attractive force we can use (4.22) with the optical (instead. 
of the electrostatic) value of the dielectric permeabilities substituted for <,. This 
estimate is low for larger separations and high for smaller ones. 
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Thus at room temperature the correction term is small for /< 5 x 10cm; 
comparison with the criterion derived from (4.27) shows that there 
exists a region of validity of the formula (4.21). 

When Jk7'/hc > 1 we need to keep only the first of the terms of the sum 
(4.13). However, we cannot immediately put n=0 because of the in- 
determinacy which this introduces (the factor €,° tends to zero, but the 
intergal over dp diverges). This difficulty can be avoided by first replacing 
p by anew variable of integration # = 2pé,,11/ €g9/¢ (as a result of which the 
factor €,° disappears). If we then put €,=0 we find 


= smal "a | fue elie athe — 1] cans 
1671? (€19 — €30)(€20 — €30) 
my JT (£10 =. €50)\ £90 Fad) a 
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Thus at sufficiently large distances the decrease in the interaction slows 
down and again goes as /-* with a coefficient which now depends both on 
the temperature and on the electrostatic value of the dielectric permea- 
bilities. 
All the other terms of the sum (4.13) decrease exponentially for large 
IkT /he. Thus, for two metals separated by a vacuum, the corrected force is 


kT AnkTl\ 2 4xkTl 
c= ieee) yf fe.) _— . mht 5 : 
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4.4. Interaction of Individual Atoms 


We now show how it is possible to go from the macroscopic formula 
(4.14) to the interaction of individual atomsin a vacuum. To do this we 
formally suppose both bodies to be sufficiently rarefied. From the point 
of view of macroscopic electrodynamics this means that their dielectric 
permeabilities are close to 1, i.e. the differences «,—1 and e«,—1 are small. 

We begin with ‘smalldistances. From (4.15) with «,=1 we have with 
the necessary accuracy : 


h ie) roe) 
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Expressing ¢(/£) in terms of e"(w) along the real w — axis according to (3.11) 
we have 
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whence for the force F we find 


Pam |. i tens ey 1 dw». - . (4,32) 
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This force corresponds to an atomic interaction with an energy} 
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where Ff is the distance between the atoms, NV 1 NV, are the numbers of 
atoms per unit volume in the two bodies. The imaginary part of the 
dielectic permeability is related to the spectral density of the ‘ oscillator 


strengths’ f(w) which are known from spectroscopy by the equation 
i 
= N. 
7 Nfl) 


we" (w)= 
(see, for example, Landau and Lifshitz, 1960, §62). Substituting this in 
(4.33) we find 
3het Pele (Or) 135) 
U(R) = — ee oye: . (4.34 
m spel 0 Myw(w1 + wy) nei 
This expression is identical with the well-known London formula (1930) 
which was obtained by ordinary perturbation theory applied to the dipole 
interaction of two atoms. For example, suppose we are considering the 
interaction of two hydrogen atoms. We use the expression 


2m 
fon= ae (L,, — Ey)|%on|? 


for the oscillator strength for the transition between the states H,, and EH, 
(Xp, is the corresponding matrix element of the coordinates of the electron 
in the atom), and when we transform from an integration over frequencies 
to a summation over the energy levels of the atom we get the London 
formula for hydrogen atoms : 
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Thus we see how this ‘microscopic’ formula is reproduced from a purely 


macroscopic theory. 
For ‘large’ distances the formula for the attractive force between two 


rarefied bodies takes the form 
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+ If the potential energy of the interaction between molecules 1 and 2 is 
— —ak-*, the total energy of the interactions in pairs of all the molecules in 
the two half-spaces separated by a gap / is 
U = —anN,N,/1202. 
The force F is 


This explains the correspondence of eqns. (4.32) and (4.33) 
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This force corresponds to the interaction of two atoms with an energy 
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where a, «, are the static polarizabilities of the atoms (¢9=1+47Na). 
Formula (4.36) coincides with the result of the quantum mechnical cal-— 
culation by Casimir and Polder (1948) of the attraction between two 
atoms when the distance between them is sufficiently large for retardation 
effects to be important. 

Similarly, by considering just one of the bodies, say body 2, as a rarefied 
medium, we can find the interaction between individual molecules and a 
condensed body. Thus, for a molecule which is at a ‘large’ distance / 
from the surface of the body we find the following formula for the inter- 
action energy 


BhCax» €y9— | 
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(4.38) 


The function ¢,, is given graphically in fig. 10. For e+ oo it tends to 1, 
while for «1 it tends to 23/30=0-77. The expression for €,)— © 
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(4.39) 


coincides with the results of Casimir and Polder (1948) for the interaction 
energy of an atom with a metal wall. 

We now consider the interaction of two atoms in a liquid (Pitaevskii 
1959). Wesuppose that both bodies are weak solutions in the same solvent 
of atoms of different kinds with concentrations (numbers of particles per 
em’) V, and Ny. We also suppose that the gap is filled with pure solvent. 
For small concentrations of dissolved atoms the dielectric permeabilities 
€,, €, of the solutions differ little from that of the pure solvent, which we 
denote by «,=e. ‘To first order in the concentration we have - 


Ty 0 Ty 2 
a=etN, (4) : a=, (52) “ 
“"1/ N,=0 4"2/ N.z=0 


From the formula (4.15) for the force at ‘small’ distances we retain 
only those terms of the same order of smallness, and (analogously to the 
transform to (4.31)) we find: 


F(l) = aoa Ns N, .| “(r) (S00) ae 
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This force corresponds to an interaction energy of the dissolved atoms 


equal to 
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Analogously the energy for ‘large’ distances is found to be 


U(R) = — panera (532) (53) ae sates. 1) 
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We see that.when the dissolved molecules interact strongly with the 
solvent the interaction forces between them are no longer determined by 
their polarisability. 

The interaction of small spherical particles in a liquid is another inter- 
estingexample. Let both bodies be such an emulsion, made up of spherical 
particles of volume » with dielectric permeability e«’ in a liquid with 
dielectric permeability «. As before the gap is filled with pure solvent. 
For Nv<1 (N is the number per unit volume) the dielectric permeability 
of the emulsion is of the form 


(<’—e)e 


€y=e€,=€+3Nv 
e +Ze 


(see, for example, Landau and Lifshitz, 1960, §9). By using the fact that 

€,—e and «,—e« are both small we find by a similar method to that used 

above the following expressions for the interaction energy of the particles 
of the emulsion : 

2The® =f @T e(2€)— e(1é) |? 
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U(R)= — aye 
Ey + 2€ 
The dimensions of the particles themselves must be small compared to 
their separation R (but not necessarily compared to Aj). 


§ 5. A Tain FIuM oN THE SURFACE OF A SOLID Bopy 
5.1. The Chemical Potential of the Film 


The general theory of van der Waals forces developed above can also 
be used for the calculation of the thermodynamic quantities of a thin 
liquid film which lies on the surface of a solid body; the thickness / of the 
film is of course supposed to be large compared to the interatomic distances, 

Equation (3.25), which was derived above, expresses the chemical 
potential per unit mass of a liquid in terms of the Green functions of the 
relevant fluctuations of the electromagnetic field. However, this formula 
is inconvenient for two reasons: first, it contains over the whole frequency 
range the quantity 0¢«/@p which has not been studied at all experimentally ; 
and second, it gives the chemical potential ¢ as a function of the density 
p, while it is usually necessary to know ¢ as a function of the pressure p. 
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We consider a film 3 lying on the surface of a solid body 1 and in equil- 
brium with its vapour 2 (fig. 11). As far as its electromagnetic properties 
are concerned, we shall treat the vapour as a vacuum, i.e. we shall always 
take its dielectic permeability «, to be equal to 1. 

According to the condition of mechanical equilibrium, the normal 
component o,,,, of the stress tensor must be continuous at the surface of the 
film. Hence we find the equation 

p=Ppolp, T)—Grz 
where p is the vapour pressure, p(p, 7’) is the pressure of the bulk at the 


given density and temperature, and 6,,, denotes the sum of all the terms 
except the first in the expression (3.24) for the stress tensor in the film. 


Fig. 11 


By solving this equation for p we find the density in the form} 


P= Pol P+ Fx_ 1). 
Substitution of this expression in eqn. (3.25) for the chemical potential 
gives 


[ie 
=6(p+denT)+ — >’ wA(r,; &,) 


47 n=0 Op 
where now ¢)(p, 7’) is the chemical potential of the bulk liquid. We 
expand ¢, in powers of the small quantity ¢,,, and using the thermo- 
dynamic relation (0¢/0p),,=1/p we find 

Up, T)=L(p, T)+ 2 Get — >! Be. 

Pp 4a n=0 Op 
Finally, by substituting here the expression for é,,, from (3.24), we find 
that the term in de¢/dp drops out and we are left with 


C(p, 2) ors Co(P, T’) + (1/p)on,' 
where o,,,' is a component of the ‘contracted’ stress tensor (4.2). This 
quantity is constant across the film (because the momentum flux is 
constant), and the force F(/), by (4.3), is directly determined by it. 
We introduce the notation j. for the ‘van der Waals part’ of the chemical 
potential of the film per unit volume of the liquid: 


C= Cy {pie rete. eens (5.1) 
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} Gzz 18 also a function of p, but since it is a small correction to the pressure 
we shall put p=pp(p, 7'). 
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From the above argument we have 
a Cae! = Fil). 
As | tends to ©, i.e. for the bulk liquid, » tends to zero. 

Thus we do not need to carry out any new calculations in order to 
determine the quantity ». It is determined by the formulae obtained 
above for F(l) (the general formula (4.13)) and the limiting cases considered 
in §§4.2, 4.3) simply by putting «,=1. 

The function (7, /) determines all the thermodynamic quantities of 
the film. So, if the film is in equilibrium with vapour at pressure p, the 
condition that the chemical potentials of the liquid and the vapour should 
be equal gives the well known equation 


eae 

an PE 

where m is the mass of a molecule, and psat is the saturated vapour pressure 

above the surface of the bulk liquid at the given temperature 7. The 

equilibrium thickness of the film is determined by this equation.+ 

If we consider a liquid film under gravity on a vertical wall, then 

P=Dsat exp (—mgz/kT') (where z is the height above the level of the liquid 
in the vessel) and from (5.2) we have the equation 


(5.2) 
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which determines the profile of the film, i.e. its thickness as a function of 
height. 
For ‘small’ (in the sense of § 4.2) film thicknesses we have the limiting 


behaviour (cf. 4.18) 
h@ ‘ ® (a= Iles= 1) g 
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For ‘large’ thicknesses j.(/) is proportional to /~* with a coefficient deter- 
mined (in accordance with (4.20)) by the electrostatic dielectric constants 
of the film (¢€3)) and of the solid body (e,9); the sign of » is the same as 
that of €g)—«;){. The function »(/) may change sign and be non-monotonic 
(cf. the similar note on the force F(/) on page 192). Non-monotonic 
behaviour of ,(/) in a certain range of values of / is in general related to a 
change in sign of the differences ¢,(i€)—«,(7€) in the wavelength range 
A~l. 
As well as the potential py it is convenient to use the ‘effective surface 
tension coefficients’ « at the boundary of the solid body and the vapour 2 


+ In the deduction of (5.2) the formula 
kT p 
Cvap = So (Psat, Z’) + uae 
for the chemical potential is used, and the liquid is taken to be incompressible, 
ie. we neglect the dependence of the chemical potential of the bulk liquid on 


ressure. 
: + Provided the important dispersion of the dielectric constant is not already 


apparent (as it is for water) at very long wavelengths. 
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to describe the properties of the film. This coefficient takes into account 
the existence of a liquid layer betewen them. This can be done formally 
by using the absorption theory relation 


ae 
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where y is the surface concentration of adsorbed particles (the number 
per em2), ¢’ is the chemical potential per particle (see, for example, Landau 
and Lifshitz (1958), §144). For the definition of » taken here (the liquid 
is considered to be incompressible) this expression is written in the form 


i= (5) ol et eRe Geen 


which is applicable for both macroscopically thick ‘wetting’ films and 
adsorption films of ‘molecular thickness’: in the latter case, of course, 1 
has only an indirect interpretation as a quantity proportional to the 


Fig. 12 
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surface concentration (/=my/p, m is the mass of a molecule). Integrating 
(5.5) and using the fact that as /-> oo the function a(1) must tend to a43 + a59, 
the sum of the surface tensions at the boundaries of the phases 1, 3 and 3, 
2, we have 
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We also note that the necessary condition for the thermodynamic 
stability of the film is that the inequality 


0 
(5) ,>° ee ahs gE Sie, Fe peas 
should be satisfied. 


If (5.2) is satisfied by several values of J, the stable state of the film 
corresponds to that for which « is a minimum; higher values of « then 
correspond to metastable states. 
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We consider several typical cases which can arise for various functions 


p(2). 


(a) If w(/) is a monotonically decreasing function which is always positive 
(fig. 12 (a)), the liquid will not wet a solid surface and no film will be formed. 
We emphasize that we are now considering only macroscopically thick 
films, to which all the theory developed here refers. As far as adsorption 
in the narrow sense of the word is concerned, this, as is known, always 
takes place to a certain extent. This corresponds to the fact.that, what- 
ever the variation of the function ,(/) in the range of molecular dimensions 
(not shown in fig. 12), it must eventually tend to — oo for 10 as p~ Inl 


which corresponds to a ‘weak solution’ of adsorbed molecules on the 
surface. 


(6) Ifu(/) increases monotonically and is everywhere negative (fig. 12 (b)), 
this corresponds to a liquid which completely wets a solid surface and 
forms (depending on the vapour pressure above it) a stable film of any 
thickness. In particular, the thickness of a film formed on a vertical wall 
tends to zero as z-> 00; the decrease initially goes as 1~z—/4, and then as 
Twz-ds, 

However, in this case too the liquid can be non-wetting if the behaviour 
of w(l) in the microscopic range leads to smaller values of the surface 
tension xf; then molecular adsorption and not a wetting film will be stable. 


(c) (1) passes through zero and has a maximum as shown in fig. 12 (c). 
With the same proviso as in (6) wetting will occur, but with the formation 
of a film which is stable only for thicknesses less than a certain limit. In 
equilibrium with the saturated vapour there will be a film of finite thickness, 
corresponding to the point A. This state is separated from the other 
stable state—the equilibrium of the solid body with the bulk liquid—by a 
metastable region AB and an unstable region BC. 


Fig. 13 
7 
2 £ 
g 3 a ps 
~ 1 — J a 


A curve j(1) of this type must lead to interesting features in the formation 
of an angle of contact @ between the liquid drop and the solid body. In 
this case the drop isin equilibrium with a film of a finite thickness /,,,,. (fig. 13 
and by the usual elementary formula we have 


cos0 = o(lmax) ~ “as hE a eae O8) 
O98 
where «(J,,,,) (with o(/) from (5.6)) plays the role of the surface tension 


Ck ES EE EE a SR ee 


+ This could be a high peak in p(/) in the molecular range of ‘ thicknesses ’. 


P.M.S. Q 
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between phases 1 and 2. Since the first term in (5.6) is small, (5.8) gives 


pie 2 et pe A) ee 
bax dl Mog 


By interpolating between .~/-* and » ~/~ we can get an estimate for 


1 ate) 
6~ — eee ie emer aeke ex lk Eh 
ie af Ge 


with @ taken from (5.4). Thus, for f@~10ev, a3~20erg/cm?, 
Lnax~ 2 x 10-5cm, we get @~0-1°. 

Thus in this case the angle of contact must be finite but it is very small 
(as distinct from the value @=0 for complete wetting and @~1 for the 
usual cases of non-wetting). It is understood that this type of behaviour 
can be observed only when the thickness of the drop is large compared to 
that of the film, i.e. we must have L@>1,,,,, where L is the size of the drop 
(fig. 13). 

(d) A curve of the type shown in fig. 12 (d)) corresponds to a film which 
is unstable over a certain range of thicknesses. The straight line BF, 
which cuts off equal areas BCD and DEF, joins the points B and F which 
have (for equal ;.) equal values of a (as can easily be seen from (5.6)). The 
branches AB and FG correspond to stable films;-the interval CE is un- 
stable, and the intervals BC and EF are metastable. 

In this case both boundaries of the range of instability (points B and F) 
correspond to macroscopic film thicknesses. Instability over the range 
from some macroscopic thickness to a molecular layer would correspond 
to the curve shown in fig. 12 (e) (for / > 0 this, as does that in fig. 12 (a), 
tends to — oo). Actually, however, such a curve would be more likely to 
lead simply to non-wetting. The boundary of stability would correspond 
to a point on the branch BC such that a horizontal line through it would 
cut off equal areas under the upper and above the lower parts of the curve 
However, the latter area related to the van der Waals forces, would be | 
small compared to the former, because of the considerably greater forces 
at molecular distances. This means that the surface tension over the 
whole of the branch BC will be greater than that which corresponds to 
molecular adsorption on the surface of the solid body, and therefore the 
film will be metastable. 


Ima x 


5.2. Non-electromagnetic Forces 


As has already been noted at the beginning of §4.2, forces of non- 
electromagnetic origin, as well as van der Waals forces, make a certain 
contribution to the chemical potential of the film, but this contribution is 
usually small. We now quote the corresponding estimates without going 
through the calculations in detail. 

At absolute zero acoustic fluctuations (in an acoustically non-dispersive 
medium) give a contribution to the chemical potential 


Hao~ hull! 
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where wu is the velocity of sound}. This is to be compared to the electro- 
magnetic part p.,,, ~~ he/l4 for 1> A, or ju, ~ he/I3A, for 1<A,. It is clear that 
Lac> Mim for all distances large compared to atomic dimensions, which is also 
a condition that the theory developed here should be valid. 

For non-zero temperatures the other limiting case for [,, usually holds, 
i.e. the influence of temperature is predominant. The corresponding 
criterion is the value of the ratio 1kT'/hu. The condition ikT/hu> 1 (as is 
the condition [kT'/fc>1 in the electromagnetic case) is essentially the 
condition for classical behaviour (iw <kT' with w~l/u or w~I/c). It is 

therefore clear that the contribution to » need not contain #,and hence from 
dimensional considerations it is clear that 
Hao ~ RDP 
(cf. eqn. (3.19)). This is comparable with j.,,, only for | ~ tc/kT which is a 
distance so large that y itself has become very small. 

The same applies to the contribution from the surface vibrations. The 
dependence of frequency on wave vector x for capillary waves on the 
surface of a liquid layer of depth / is given by the well-known formula 


ors aK thel 
p 
where « is the surface tension (see, for example, Landau and Lifshitz 
(1959), §61); for a deep layer of liquid (J—> 00) w?=ax/p. Calculating the 
energy of the zero point vibrations (with the subtraction of this energy as 
1+ oo), we find that at absolute zero the corresponding contribution to the 


chemical potential is 
en a 
eur ~ poi2 p : 


In fact, however, the other limiting case is valid, i.e. the classical con- 
dition (4/kT)\/(a/p)l-? <1 applies; on general statistical grounds calcu- 
lation gives a contribution of the same order of magnitude p,,,~ kT'/1? as 
in the acoustic caset. 

To explain the properties of helium films other mechanisms related to 
inhomogeneities in the distribution of liquid density across the film have 
been proposed by several authors. In its crudest form the corresponding 


+ This expression is analogous to »~fc/l* obtained for the electromagnetic 
contribution (in a non-dispersive medium). This can be derived by, for | 
example, summing the energy of the zero point acoustic vibrations in the gap 
(of width 1) similarly to the method used by Casimir (1948) for the zero point 
electromagnetic vibrations. We note that Atkins’ (1954) result, which pre- 
dicted that 1a, should depend on the film thickness as [-*, was due to an in- 
correct cut-off of a divergent integral. 

+In all this we are making literal estimates, but it should be remembered 
that really the expressions for guy and face contain (as is shown by a more 
detailed analysis) numerical coefficients which are as small as those which 
appear in the electromagnetic part fem. The appearance of comparatively 
small numerical coefficients is a general characteristic of the theory developed 


here. 
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calculation treated the helium in the film as an ideal gas whose particle 
wave functions have nodes at the wall and at the surface of the film. This 
model leads to a sharply inhomogeneous density distribution with a maxi- 
mum in the centre, and to a contribution to the chemical potential which 
is proportional to /-2. This treatment is however quite inadmissible (as 
was pointed out by Mott (1949)), since the interaction between the atoms 
smooths out the wave function of the ground state of the system and the 
density inhomogeneities extend (in the body of the liquid) only over 
distances of the order of the atomic separation. The contribution to the 
chemical potential from this inhomogeneity decreases exponentially as — 
the film thickness is increased. 

The same dependence on film thickness applies to the contribution from 
the specific properties of helium below the A-point (superfluidity). Itis only 
in the immediate neighbourhood of the A-point, where the density of the 
superfluid component is very small, that the inhomogeneity in the distri- 
bution of the latter produces any noticeable effects (see Ginsburg and 
Pitaevskii 1958). Even at 0-01° from the A-point the decrement of the 
exponential decay becomes comparable to the interatomic distance. The 
result of Franchetti (1957), who derived a contribution to the chemical 
potential which was proportional to/-*, is due to the inadequacy of the model 
of non-interacting elementary excitations in helium which he used. 


5.3. Films of Inquid Helium 


We consider in particular liquid helium films, on which there is already an 
extensive literature. 

For helium films the general formula (4.14) can be considerably simplified 
by noting that the dielectric permeability of liquid helium is very close to 
unity, i.e. the difference e, (1€)— lissmall. Carrying out the corresponding 
expansions in (4.14) we get : 


= A Raa y be ota way yfa 
A oe ik eee *P errs c 3 


exp (— 2pél/c) dp dé, 

8 = V/[e,(i€) —14 p?]. a AN Cys ho ee OED 

However, calculation even from this simplified formula is hindered by the 

need to know the form of the function e(i€) for the liquid helium and for the 

solid wall over a wide frequency range, in particular in the extreme ultra- 

violet: in the integral (5.11) the important range of wavelengths are those 

for which A~1, while in fact the thickness of helium films is of the order of 
10-* cm. 

For further simplification of (5.11) it is a reasonable approximation to use 
the fact that the main absorption band of helium lies in the extreme ultra- 
violet, while those for the solid body of the wall (metals, quartz) are at 
much lower frequencies. In other words we shall assume that the function 
€,(€) practically coincides with the electrostatic value €39 over the whole 
range of variation of € in which «,(i¢)—1 (and with it the whole of the 
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expression under the integral in (5.11)) is not too small. Then e,— 1 can 
be taken in front of the integral sign, and the remaining integral is. trans- 
formed as in the limiting case of small thickness (lis small compared to the 
wavelengths A, in the main absorption band of the solid body). We replace 
p by the variable of integration x=2pél/c and use the fact that values of 
«~ 1 correspond to large values of p to replace the curly brackets in (5.11) 
by 2p?(e,—1)/(e, +1). As a result we find 


_ A@(Ex9 — 1) 


h\= AG ANSE NIE eg) ¢ 
p(l) = — See (5.12) 
where we have used the notation 
= m €,(2E) 11 
oO= ae ad oats . . . . . . ls 
I, e(ig) El eee 


i.e. an average frequency which is characteristic of the given solid body. 

We note that the function [e(w)—1]/[e(w)+1] has the same analytic 
properties in the upper half-plane of the complex variable w as the function 
«(w)—1. This is sufficient to enable us to use the same formula to trans- 
form the integral along the imaginary axis to one along the real axis as was 
valid for the function «(w)—1 (see Landau and Lifshitz (1960), § 62). 
Namely we can write the integral for w in the form 


a= i Teed le Sala ancencat Nie da SEES . (5.14) 
0 €(w) +1 o [ey (w) +1) + [a"(w)? 
where «’(w) and ¢’’(w) are the real and imaginary parts of the dielectric 
permeabilities for real frequencies, i.e. experimentally directly measurable 
quantities. 

Thus, for the actually observed thicknesses of helium film one would 
expect » to vary as [-%, and correspondingly the form of the film profile to 
be l~z—¥8, Calculation of the coefficient in this dependence needs, how- 
ever, a knowledge of the optical properties of the solid body (the wall) over a 
wide range of frequencies. We emphasize that the calculation of this 
coefficient on the basis of data on the interaction of individual helium atoms 
with a solid wall is in any case inadmissible. 

We also give the expression for , for ‘large’ film thicknesses (>A). 
The corresponding transform from (5.11) is made by introducing the 
variable x = 2pél/c instead of & and by replacing €, by 4). The integrations 
over both dx and dp are carried out analytically, and as a result we have 

3hC(€39 — 1) eyo — 1 

p(l) = — ee ae 5 Saale) = > + (5.15) 
with the function ¢,4 taken from (4.38). For a metal €,)> © and $,4=1. 
For quartz, which is transparent from ~ 0-15 1 to several yu, it is also mean - 
ingful to consider the case when / lies within this region. The corres- 
ponding dependence ,(2) is found from the same formula (5.15), but eq 
does not refer to the electrostatic dielectric constant but to the optical value 
c,, i.e. the square of the refractive index in the transparent optical region 
(cf. note on page (195). 
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Equations (5.11), (5.12), (5.15) do not contain the temperature, i.e., 
strictly speaking they refer to absolute zero. However, the corrections for 
temperature must be relatively small, and there are no grounds for 
expecting any material change in the form of the profile of the film as the 
temperature is changed, either below or above the A-point (outside its 
immediate neighbourhood). 

The difficulties of experimental observation of the thickness and profile 
of a helium film under conditions which are sufficiently near to the ideal of 
thermal equilibrium are very great, and it is only very recently that they 
have been overcome to the extent that the data (in the helium II region) 
could be considered at all reliable (see the reviews by Jackson and Grimes 
(1958) and by Atkins (1957). 

In accordance with what was said in $5.2, there is no physical basis for 
expecting a film profile of the form 

pgz ~ al + bI-*. 

Anderson et al. (1960) point out that their data on the thickness of a 
helium film on a steel surface (up to a height of 40 em) are well described by a 
law of the form pgz=al-? (the absolute values of the thicknesses are not. 
given). 

The same law accounts for the results of the measurements by Ham and 
Jackson (1957) and by Grimes and Jackson (1959) (for heights of 0-4 to 
7 em) with a coefficient az 4:5 x 10-145erg. A comparison of this value with 
the coefficient of [-3 in eqn. (5.12) (putting «5, — 1= 0-057) gives i@ ~ 7-5ev. 
This is a reasonable value for a metal (steel). 

The coefficients in (5.12) and (5.15) (for €,,— 00) are equal when / = 3c/2a, 
i.e. in this case for 1~5x10-*cm. This means that the experimentally 
observed film thicknesses (100-400A) are near the transition region 
between the /-° and /-* dependences. 
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